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The gene cluster involved in aclacinomycin biosynthesis, and its use for genetic 



engineering 



Field of the invention 



This invention relates to the gene cluster for aclacinomycin biosynthesis derived from 
Streptomyces galilaeus, and the use of the genes included therein to obtain hybrid anti- 
biotics, or to increase yields of aclacinomycins or related antibiotics. 

10 Background of the invention 

Anthracyclines are widely used anticancer agents. Seven different anthracyclines are in 
worldwide clinical use: daunorubicin, doxorubicin, idarubicin, epirubicin, pirarubicin, 
zorubicin and aclarubicin. A representative compound is doxorubicin, being the most 

1 5 efficient and acting on a wide array of malignancies. A variety of toxic effects, like 

cumulative cardiotoxicity found with doxorubicin has sometimes led to discontinuation of 
the treatment. Furthermore, there are some type of malignancies which do not respond to 
available anthracyclines. The mechanism of action of anthracyclines, reflecting to their 
clinical efficiencies, is not clear, although most researchers consider inhibition of topo- 

20 isomerase II as a desired effect. Generation of free radicals derived from quinonic structures 
is suggested to be related to side effects such as cardiotoxicity. Anthracyclines have recently 
been reviewed by Professor Strohl and his group (1997). 

Aclacinomycin A (aclarubicin) first described by Oki et al (1975) is an anthracycline anti- 
25 biotic produced by Streptomyces galilaeus ATCC 31133 and galilaeus ATCC 3 1615. It 
is active against tumor cells and exhibits alleviated toxic properties as compared with doxo- 
rubicin. However, its activity does not reach solid tumors, limiting its use in leukemia 
treatment. Aclarubicin differs from the other counterparts in its structure. A trisaccharide 
moiety, rhodosamine-2-deoxyfucose-cinerulose A is attached at C-7 by a glycosidic bond, 
30 whereas at the corresponding position of daunomycins only one sugar residue, daunos- 
amine, is attached. 
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Despite the long history of anthracyclines, three decades or so, the studies on their bio- 
synthesis are still going on, and there is further interest to obtain novel molecules for the 
development of cancer chemotherapeutics. A method currently used for finding novel 
molecules for drug screening is genetic engineering. Cloning the genes for anthracycline 
5 biosynthesis facilitates the production of hybrid anthracyclines, as well as their use in 

combinatorial biosynthesis to generate novel molecules. As regards the chemical nature of 
anthracyclines currently in clinical use, aclarubicin has unique features which make its 
biosynthetic genes interesting in creating novel products. 

10 Regarding the genes for deoxyhexose pathway, Madduri et al (1 998) have reported that a 
gene derived from avermectin biosynthesis cluster caused the production of hybrid anthra- 
cyclines altering a sugar moiety when transferred into a S. peucetius strain. The product 
obtained was epirubicin, a commercially important anthracycline. In this case a hydroxy 
group in the daunosamine moiety was in the opposite stereochemistry due to the action of 

1 5 an avermectin biosynthesis gene. 

S. galilaeus has been used as the host to prepare hybrid anthracyclines using the genes 
derived from rhodomycin pathway from S, purpurascens (Niemi ei al, 1994) and from 
nogalamycin biosynthesis cluster from S. nogalater (Ylihonko et al, 1996a). The genes for 

20 nogalamycin pathway were used to generate the hybrid anthracycline production in S. 

steffisburgensis producing typically steffimycin (Kunnari et al, 1997). Previously, biosynt- 
hesis genes for actinorhodin have been expressed in S, galilaeus, resulting in the formation 
of aloesaponarin (Strohl et al, 1991). These hybrid compounds were modified in the 
aglycone moiety. Recently, the biosynthesis genes involved in deoxyhexose pathway of 

25 nogalamycin were used to generate hybrid compounds using the S. galilaeus mutants as 
hosts (FI pat. appln No. 982295). 

As shown above, 5. galilaeus has been used as a cloning host to generate novel molecules, 
whereas its use to donate the genes has not been described. The identified genes involved in 
30 aclacinomycin biosynthesis include polyketide reductase gene (Tsukamoto et al, 1994), 
aklanonic acid methyl ester cyclase (GeneBank, ACCESSION AF043550) and genes for 
polyketide synthase (Hutchinson and Fujii, 1995; the sequence not available). 
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Summary of the invention 

The present invention concerns a gene cluster, most of the genes of which are derived from 
deoxyhexose pathway for rhodosamine, 2-deoxyfucose and/or rhodmose. The gene cluster 
5 was cloned from S. galilaeus ATCC 31615 and it is involved in biosynthesis of aclacino- 
mycins. 

Detailed description of the invention 

10 The experimental procedures of the present invention include biochemical and chemical 
methods conventional in the art. Detailed description of the techniques not explained here 
are given in the manuals by Hopwood et al 'Genetic manipulation of Streptomyces: a 
laboratory manual'. The John Innes Foundation, Norwich (1985) and by Sambrook et al. 
(1989) 'Molecular cloning: a laboratory manual'. 



The publications, patents and patent applications cited herein are given in the reference list 
in their entirety. 

The present invention concerns particularly the discovery of the gene cluster for aclacino- 
20 mycin biosynthesis. The cluster, when introduced into 5. peucetius strains caused the 
production of hybrid antibiotics modified in their sugar moiety. 

Several strategies may be adopted to clone genes for an antibiotic. Using E. coli as a host 
for a gene library, hybridization is the most advantageous screening strategy. The probe for 

25 hybridization may be any known fragment that shows sufficient homology to the bio- 
synthetic cluster for aclarubicin sugars, to be able to hybridize with said cluster. A DNA 
fragment which is identical to the desired region is preferred. Such a fragment, called Sg- 
dht, was obtained by PGR amplification of S. galilaeus chromosomal DNA, using de- 
generated oligonucleotides armeaJing to the conserved region of 4,6-dehydratase gene. 4,6- 

30 dehydratase is the first enzyme participating to a reaction series that converts a glucose 
molecule bound to a nucleotide into 6-deoxy sugars generally found in antibiotics. Using 
this probe it was possible to clone the cluster of deoxyhexose pathway fi"om a restricted 
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gene library. To simplify the cloning strategy the library was prepared in a pUC-based 
plasmid (e.g. pBluescript or pWHMl 109) replicating in E. coli. 

The strategy to clone the genes involved in aclacinomycin biosynthesis according to the 
5 invention was in brief: Total DNA was isolated from S. galilaeus (ATCC 31615) and 
digested with several restriction enzymes that yield fragments of 10 kb in average. 
Restriction fragments were analyzed by Southern hybridization using a homologous DNA 
fragment, Sg-dht, as a probe. Bghl gave a hybridized fragment of 8.5 kb, and a double 
digestion with Xhol and Notl gave a hybridized fragment of 7 kb. DNA digestion using (i) 

10 BglW and (ii) Xho\-Not\ was carried out and the fragments were ligated to the E. coli- 
Streptomyces shuttle vector, pWHMl 109, digested with BamHl and to the pBluescript 
digested with Xhol-Noil, respectively. The ligation mixtures were introduced into E. coli 
XLlBlueMRF' that exhibits alleviated restriction-modification systems. Colonies were 
plated on the agar plates in the dilution to give 200 to 600 cfu (colony forming units) per 

1 5 plate. Well grown colonies were transferred in nylon membranes for hybridization, which 
was carried out using the Sg-dht probe. Six out of the 786 5^/II-digested clones gave 
hybridization signal and 7 out of 1523 of those clones carrying ATzoI-TV^o/I fragments. 
Hybridization and washes were carried out in the stringent conditions of 65° C in a low salt 
concentration. Several techniques for the labeling of the probe and for hybridization are 

20 possible, but the procedure according to Boehringer Mannheim's "The DIG System User's 
Guide for Filter Hybridization" is preferred. The colonies giving hybridization signals were 
cultivated for plasmid isolation. The plasmids were analyzed by Southern hybridization to 
confirm the reliability of the colony hybridization. Plasmids containing the desired DNA 
fi-agments (Sg4 and Sg5) were designated as pSgc4 (5^/II-fragment) and pSgc5 {Xho\-Noi\ 

25 fi*agment)(see Fig. 2). 

The fragments, Sg4 and Sg5, were subcloned for sequencing in^. coli vectors pUC19 and 
pBluescript. In total 30 subclones were used to obtain the nucleotide sequence of Sg4 and 
Sg5. The sequenced cluster revealed thirteen genes involved in biosynthesis of aclacino- 
30 mycins. Comparison with the sequences found in the sequence library suggested the 

functions as sgal for an activator, sgdi for a dehydratase, sga^ for oxidoreductase, sga5 
for dTDP-glucose 4,6-dehydratase, sga6 for glycosyl transferase (GTF), sgal for a putative 
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isomerase, sga8 for aklaviketone reductase, sga9 for a putative polyketide assembler, sga]0 
for a putative cyclase, sgal 1 for aminomethylase, sgall for glucose- 1 -phosphate thymidylyl 
transferase, sga\3 for aminotransferase. The function of sgal is not suggested based on 
similarity searches. Based on the deduced functions, nine genes are involved in glycosyla- 
5 tion pathway. The genes involved in the formation of aglycone are sgaS^ sga9^ and sgalO. 
The activator, Sga2, may control both the glycosylation system and the formation of akla- 
vinone via polyketide pathway. 



Sg4 derived from pSgc4 was cloned in the Streptomyces expression vector pIJE486 

10 (Ylihonko et al, 1996b) in S. lividans TK24 to give pSgs4. This vector is a high copy 
number plasmid that replicates in several Streptomyces spp. (Ward et aL, 1986) and it 
contains a constitutively expressed promoter, ermE (Bibb et al, 1985) upstream from the 
multiple cloning site. The plasmid pSgs4 isolated from TK24 was introduced into the S. 
galilaeus strains that are blocked in deoxyhexose pathway of aclacinomycin biosynthesis 

1 5 and into the S. peucetius mutants producing e-rhodomycinone based on a lesion in 
glycosylation genes. The ability of aclacinomycin production was restored by three S, 
galilaeus mutants, H063, H054 and H065. The mutant strain H063 accumulates aklavinone 
and it was completely complemented by the plasmid pSgs4. Instead, H054 and H065 
producing aklavinone glycosides sharing neutral sugars, but not rhodosamine, were only 

20 partially complemented by pSgs4. Surprisingly, H063 carrying pSgs4 (H063/pSgs4) was 
able to produce aclacinomycins two-fold to that of the wild type S. galilaeus, S. peucetius 
Ml 8 and M90 which produce e-rhodomycinone were selected to hosts for pSgs4. L- 
rhamnosyl-s-rhodomycinone (El Khamed et al, 1977) was obtained when pSgs4 was 
expressed in the mutants Ml 8 and M90 and, in addition, M18/pSgs4 produced L-daunos- 

25 aminyl-£-rhodomycinone (Essery and Doyle, 1 980). The structures were not new ones but 
this demonstrates the ability of the gene cluster according to the present invention to 
generate hybrid products in a heterologous host. To produce hybrid compounds we prefer 
to use El medium supplemented with a suitable antibiotic, in this case, thiostrepton, to 
maintain the selection pressure for the plasmid containing strains. The products were 

30 extracted by organic solvents and purified by chromatography to obtain the compounds in 
high purity for structural elucidation. 
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Examples to further illustrate the invention are given hereafter. 



Brief description of the drawings 



5 FIG. 1 shows the structures of aclacinomycin, daunomycin and £-rhodomycinone. 



FIG. 2 is a diagram of the gene cluster for aclacinomycin biosynthesis. 



FIG* 3 describes the proposed biosynthesis pathway for sugars found in aclacinomycins. 



FIG. 4 shows the structures of the hybrid compounds produced by M18/pSgs4 (1 and 2) 
and M90/pSgs4 (2). 

EXPERIMENTAL 

15 

Materials used 

Restriction enzymes used were purchased from Promega (Madison,Wisconsin, USA), 
Fermentas (Lithuania) or Boehringer Mannheim (Germany), alkaline phosphatase from 

20 Boehringer Mannheim, and used according to manufacturers' instructions. Proteinase K was 
purchased from Promega and lysozyme from Sigma. Hybond™-N nylon membranes used in 
hybridization were purchased from Amersham (Buckinghamshire, England), DIG DNA 
Labelling Kit and DIG Luminescent Detection Kit from Boehringer Marmheim. Qiaquick 
Gel Extraction Kit from Qiagen (Hilden, Germany) was used for isolating DNA from 

25 agarose. 

Bacterial strains and their use 



Escherichia coli XLlBlueMRF' (Stratagene, La Jolla, California) was used for cloning. 

30 



10 



Streptomyces lividans TK24 was the first cloning host for gene expression. The strain was 
provided by prof. Sir David Hopwood, John Innes Centre, UK. 
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The wild type, Streptomyces galilaeus ATCC 31615, produces aclacinomycins. It was used 
here to donate the genes of the invention. 



Streptomyces galilaeus H039 (Ylihonko et al, 1994) produces Akv-(Rho)o.3. It was used as 
5 an expression host for pSgs4 being more easily transformed than the other mutants or the 
wild type. 

Streptomyces galilaeus H054 (Ylihonko et al, 1994) produces Akv-Rho-dF-(CinA)o.i, 
Akv-dF-dF-(CinA)o_j and Akv-dF-Rho-Rho. It was used as an expression host for pSgs4. 

10 

Streptomyces galilaeus H063 produces aklavinone. It is a mutant strain derived from the 
wild type S. galilaeus. H063 was used as an expression host for pSgs4. 

Streptomyces galilaeus H065 produces aklavinone with neutral glycosides. It is a mutant 
1 5 strain derived from the wild type S. galilaeus, H065 was used as an expression host for 
pSgs4. 

Streptomyces peucetius Ml 8 and M90 producing £-rhodomycinone are the mutants derived 
from S, peucetius vsit, caesius (ATCC 27952). They were used as expression hosts for 
20 pSgs4. 

Plasmids 

E. coli cloning vectors pBluescript SK (Stratagene) and pUC19 (Pharmacia, Sweden) were 
25 used for making the subclones for sequencing and pBluescript was used also as a vector of 
a gene library. 

pWHMl 109 (provided by prof CR Hutchinson, Wisconsin, USA) is a shuttle vector 
replicating in E. coli and in streptomycetes. It was used as a vector of a gene library. 

30 

pIJ486 is a high copy plasmid vector provided by prof Sir David Hopwood, John Innes 
Centre, UK (Ward et al, 1986). 
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pIJE486 (Ylihonko et al, 1996b) is an expression vector containing erniE (Bibb et al, 
1985) to promote expression of the cloned genes. 

Nutrient media and solutions 

5 

For cultivation of S. galilaeus for total DNA isolation TSB medium was used. Lysozyme 
solution (0.3 M sucrose, 25 mM Tris, pH 8 and 25mM EDTA, pH 8) was used to isolate 
total DNA. TE buffer (10 mM Tris, pH 8.0 and ImM EDTA) was used to dissolve DNA. 



1 0 TRYPTONE-SOY A BROTH (TSB) 

Per litre: Oxoid Tryptone Soya Broth powder 30 g. 



ISP4 

Bacto ISP-medium 4, Difco; 37 g/1. 

15 

El Per litre in tap water: 



glucose 20 g 

soluble starch 20 g 

Farmamedia 5 g 

20 Yeast extract 2.5 g 

K2HPO4O3H2O 1.3 g 

MgSO4o7H20 1 g 

NaCl 3 g 

CaC03 3 g 



25 pH adjusted to 7.4 before autoclaving 



General methods: 



NMR data was collected with a JEOL JNM-GX 400 spectrometer. and '^C NMR 
30 samples were internally referenced to TMS. 

The anthracycline metabolites were determined by (i) HPLC (LaChrom, Merck Hitachi, 
pump L-7100, detector L-7400 and integrator D-7500) using a LiChroCART RP-18 
column. Acetonitrile:potassium hydrogen phosphate buffer (60 mM, pH 3 .0 adjusted with 
35 citric acid) was used as a mobile phase. Gradient system starting from 65 % to 30 % of 
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potassium dihydrogen phosphate buffer was used to separate the compounds. The flow rate 
was 1 ml/min and the detection was carried out at 480 nm, and (ii) by TLC using precoated 
Kieselgel 60 F254 glass plates (Merck, Darmstadt, Germany) with an elution solution of 
toluene:ethyl acetate: methanol: formic acid (50:50:15:3). 



1SP4 plates supplemented with thiostrepton (50 /ug/m\) were used to maintain the plasmid 
carrying cultures. 

Example 1, Cloning the gene cluster for aclacinomycin biosynthesis 



1.1 Selection of clones by hybridization 

For isolation of total DNA, Streptomyces galilaeus was grown for four days in 50 ml of 
TSB medium supplemented with 0.5% glycine. The cells were harvested by centrifliging for 
15 min (3900 x g) in 12 ml Falcon tubes, and stored at -20 °C. Cells from a 50 ml culture 

1 5 were used to isolate DNA. 5 ml of lysozyme solution containing 5 mg/ml of lysozyme was 
added on the cells of each Falcon tube, and incubated for 20 min at ST^'C. 500 |al of 10% 
SDS containing 0.7 mg of proteinase K was added on the cells, and incubated for 80 min at 
62 °C, another 500 ^1 of 10% SDS containing 0.7 mg of proteinase K was added, and 
incubation was continued for 60 min. The sample was chilled on ice and 600 |il of 3M 

20 NaAc, pH 5.8 was added, and the mixture was extracted with equilibrated phenol (Sigma). 
The phases were separated by centrifliging (1400 x g) for 10 min. The DNA was 
precipitated from the water phase with an equal volume of isopropanol and collected by 
spooling with a glass rod and washed by dipping into 70% ethanol, air dried and dissolved 
in 500 ^1 ofTE-buffer. 



Southern hybridization to determine suitable restriction enzymes for preparing the restricted 
plasmid libraries was carried out using Bglll, Xhol, Noil and their combinations. A fragment 
of about 9 kb hybridizing with the Sg-dht probe was preferred. For hybridization 600 ng of 
digested S. galilaeus DNA was loaded onto the agarose gel and after electrophoresis, the 
30 DNA was transferred from the gel to a nylon membrane by vacuum blotting. Hybridization 
was carried out according to Boehringer Mannheim's manual The DIG System User's 
Guide for Filter Hybridization'. The probe for hybridization, Sg-dht, which was used for 
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colony hybridization as well, was obtained by amplifying a gene fragment from the S. 
galilaeus DNA which is internal to the 4,6-dehydratase gene and corresponds to the 
fragment of 6345 to 6861 shown in SEQ ID NO: 14. PCR was used for amplification, and 
the sequences for the degenerated oligonucleotide primers were 5 -CSGGSGSSGCS- 
5 GGSTTCATSGG-3' (forward, SEQ. ID. NO: 15) and 5'-GGGWRCTGGYRSGGSCCG- 
TAGTTG-3' (reverse, SEQ. ID. NO: 16). Suitable fragments were a 9 kb Bglll fragment 
and a 7 kb Xhol-Notl fragment. 

Ten micrograms of the chromosomal DNA was digested with Bglll. The DNA fragments 
1 0 were separated by agarose gel electrophoresis and the band of 8 to 9 kb were cut from the 
0.6% low gelling temperature SeaPlaquet® agarose. The DNA band was isolated from the 
gel using Qiagen Gel Extraction Kit. The isolated fragment was ligated to pWHMl 109 
plasmid vector digested with^amHI and dephosphorylated, in the ratio of 3 moles of the 
insert DNA to 1 mole of the vector DNA. The ligated DNA was introduced into E. coli 
1 5 XLlBlueMRF' by electroporation. Using the whole ligation mixture 786 colonies were 

obtained. The colonies were grown on agar plates for at least 12 h and transferred to nylon 
membranes. Hybridization of colony membranes was carried out as Southern using Sg-dht 
as a probe. Six clones gave signal in hybridization and the corresponding colonies were 
plated on agar and inoculated in 3 ml of LB medium for isolation of the plasmid DNA. 
20 Southern hybridization was used to study whether the plasmids derived from the clones 
carried the desired insert. Four of these plasmids contained the 4,6-dehydratase gene 
fragment and gave the identical restriction map thus carrying the same fragment 
representing both orientations. The fragment was designated as Sg4 and the plasmid 
containing the fragment as pSgc4. 



In the same manner the plasmid library representing a 7 kb Xhol-Notl DNA fragment 
derived from S. galilaeus was constructed. pBluescript was digested with Xhol-Notl and 
the library containing the gene fragments of around 7 kb was constructed. In total 1523 
colonies were hybridized and seven turned to be the desired clone. As described above, the 
30 clones were studied for the Xhol-Notl fragment. The insert fragment was designated as Sg5 
and the plasmid as pSgc5. The strain E. coli XLlBlue MRFVpSgcS obtained was deposited 
according to the rules of the Budapest Treaty at Deutsche Samralung von Mikroorganismen 
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und Zellkulturen GmbH (DSMZ) on August 12, 1999 with the accession number DSM 
12999. The fragments Sg4 and Sg5 overlap within 836 bp corresponding bases from 6181 
to 7016 in SEQ IDNO:14. 

5 1.2. Subcloning the fragments for sequencing 

To determine the nucleotide sequence of the whole cluster of the Sg4 and Sg5 suitable 
subclones were constructed. The convenient restriction sites were used for subcloning the 
14806 bp region in the plasmids pUC19 and pBluescript. Nineteen subclones were needed 
to sequence Sg4, and 1 1 subclones for Sg5. 



E. coli XLlBlueMRF' cells containing the subcloned plasmids were cultivated overnight at 
37°C in 5 ml of LB-medium supplemented with 50 jxg/mi of ampicillin. To isolate plasmids 
for sequencing reactions Wizard Plus Minipreps DNA Purification System kit of Promega 
or Biometra Silica Spin Disc Plasmid DNA Miniprep kit of Biomedizinische Analytik Gmbh 
1 5 were used according to the manufacturers' instructions. 

DNA sequencing was performed using the automatic ABI DNA sequencer (Perkin-Elmer) 
according to the manufacturer's instructions. 

20 1.3 Sequence analysis and the deduced functions of the genes 

Sequence analyses were made using the GCG sequence analysis software package (Version 
8; Genetics Computer Group, Madison, Wis., USA). The translation table was modified to 
accept also GTG as a start codon. Codon usage was analyzed using published data (Wright 
and Bibb 1992). 



According to the CODONPREFERENCE program the sequenced DNA fragment revealed 
1 1 complete open reading fii-ames (ORFs), and two 5' ends of the other ORFs {sga\ and 
sga\3). The fianctions of the genes were concluded by comparing the amino acid sequences 
translated from their base sequences to the known sequences in the data banks. The results 
30 are shown in Table 1 referring to the sequence data given in the application. 
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The suggested functions for the genes match well with a proposed biosynthetic pathway of 
sugars of aciacinomycins (Fig. 3). The last residue in a trisaccharide moiety of aclacinomy- 
cins is rhodinose that is enzymatically converted to cinerulose. Aclacinomycin N, a 
precursor of aclarubicin, contains rhodinose as the third sugar residue. 



Table 1. 



Gene 


Position 


Amino 
acids 


Deduced function 


Remarks 


sga\ 


-1986 
compl 


>662 


unknown 


not complete 
Seq ID.NO:l 


sgal 


2523-3341 


272 


activator 


Seq.ID,NO:2 


sgal 


3355-4659 
comni 


434 


dehydratase 


Seq.ID.N0;3 


sga4 


4821-5810 


329 


oxidoreductase 


Seq.ID.NO:4 


sgaS 


5920-6891 
compl 


323 


dTDP -glucose 4,6-de- 
hydratase 


Seq.ID.NO:5 


sga6 


6879-8210 
compl 


443 


glycosyl transferase 
(GTF) 


Seq.ID.NO:6 


sgal 


8287-9618 
compl 


443 


putative isomerase 


Seq.ID.NO:7 


SgaS 


9642-10445 
compl 


267 


aklaviketone reductase 
(KRII) 


Seq.ID.NO:8 


sga9 


10471- 

10905 

compl 


144 


putative polyketide 
assembler 


Seq.ID.NO:9 


sgalO 


11115- 
11894 


259 


putative cyclase 


Seq.ID.NO:10 


sgal 1 


11956- 
12672 


238 


aminomethylase 


Seq.ID.NO:ll 


sgal 2 


12685- 

13560 

compl 


291 


glucose- 1 -phosphate 
thymidylyltransferase 


Seq.ID.NO:12 


sgal 3 


13783- 
14805 


341 


aminotransferase 


Seq.ID.NO:13 
not complete 
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1.4 Expression cloning in Streptomyces strains 

The 8 kb BamHl-HindlU fragment from pSgc4 was ligated in pIJE486 to give pSgs4. 
Plasmid pSgs4 was introduced into S. Uvidans TK24 by protoplast transformation. The 
strain S. Uvidans TK24/pSgs4 obtained was deposited according to the rules of the 
5 Budapest Treaty at Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH 
(DSMZ) on August 12, 1999 with the accession number DSM 12998. The plasmid pSgs4 
was isolated from the strain, and further transferred into S, galilaeus mutant H039. The 
plasmid preparate isolated from H039 was subsequently introduced into H063, H054, and 
H065 mutants deficient of glycosylation system of aclacinomycins. The usage of H039 as a 
1 0 primary S. galilaeus host was due to the better efficiency for the intake of foreign DN A. 

S, galilaeus mutants were studied for complementation by cultivating the clones containing 
pSgs4 in El medium supplemented with thiostrepton (10 yug/ml). The products from a 500 
(A sample of the culture broth were extracted with toluene:methanol (1 .T) at pH 1 , The 

1 5 metabolites from both the transformed clones and the mutants were analyzed by TLC and 
HPLC to find the differencies caused by pSgs4. H063 producing endogenously aklavinone 
was restored to aclacinomycin producer with pSgs4. No aklavinone was found in the 
culture broth of H063/pSgs4. However, complementation was not completed when pSgs4 
was expressed in H054 and H065. Both of the mutants produce aklavinone with neutral 

20 glycosides. Incomplete complementation was presumably due to the loss of the plasmids of 
some bacterial cells during cultivation, or a low expression of the genes needed as an 
activator is not present in pSgs4. 

In the same manner, pSgs4 isolated from TK24 was introduced into the S. peucetius 
25 mutants MIS and M90. The characteristic product for these mutants is e-rhodomycinone. 
The strains M18/pSgs4 and M90/pSgs4 containing the plasmid were cultivated in El 
medium supplemented with thiostrepton (10 yug/ml), and the metabolites therein were 
analyzed by TLC and HPLC. Both of the clones revealed an altered production profile as 
compared with the products obtained ft*om the mutants. M90/pSgs4 accumulated a 
30 glycosylated product, yielding e-rhodomycinone as the aglycone. The compound was 
identified as L-rhamnosyl-e-rhodomycinone which has been previously synthesized 
(CAS=63252-1 1-9) by El Khamed et al (1977). 
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M18/pSgs4 produced two compounds difTering from the parental strain. According to the 
HPLC and TLC data one compound was the same as was produced by M90/pSgs4, L- 
rhamnosyl-£-rhodomycinone, and the other one was L-daunosaminyl-e-rhodomycinone, 
5 which was previously characterized by Essery and Doyle (1980). 



Table 2: TLC and HPLC data of the hybrid products 



Product 


Rf-value 


Retention time 


£-rhodomycinone 


0.67 


6.70 


L-rhamnosyl-e-rhodomycinone 


0.38 


5.00 


L-daunosaminyl-8-rhodomycinone 


0.04 


4.06 



15 1.5 Applicability of pSgs4 for strain improvement 

Since H063 was completely complemented by pSgs4, the production level of aminogly- 
cosides was studied. For this purpose, H063/pSgs4, H063 and the wild type S. galilaeus 
were cultivated in El medium in the Erlenmeyer bottles for four days. Two samples of 2 ml 
from each culture were extracted first with toluene:methanol (1 : 1) in acidic conditions to 

20 remove the neutral glycosides and the aglycones. The extraction procedure was repeated 
until neutral glycosides and the aglycones had disappeared from the water phase. The 
amount of anthracycline metabolites in toluene phase was determined and is shown in Table 
3. Aclacinomycins containing rhodosamine were extracted from the water phase by 
chloroform. Both toluene and chloroform extracts were analyzed by TLC and toluene 

25 phases contained mostly aklavinone and the degradative products. Chloroform phases 
contained mainly aminoglycosides, although minor amounts of the aglycones were also 
detected. Extracts were evaporated to dryness and subsequently dissolved into 1 ml of 
methanol. The amounts of anthracycHne metabolites were detected by spectrophotometer at 
430 nm. The amounts related to absorbance were calculated using an extinction coefficient 

30 of 13000. The results given as mg/1 of cultivation broth are shown in Table 3. The 

production of aclacinomycins by H063/pSgs4 was at least twofold better than obtained by 
the wild type. 
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Table 3. 





Chloroform phase aminoglycoside 
fraction 


Toluene phase aglycone fraction 


Sample 


Absorbance 


Concentration 
(mg/l) 


Absorbance 


Concentration 
(mg/l) 


H063 


0.401 


12.6 


2.956 


92.3 


H063/pSgs4 


2.751 


85.9 


2.974 


92.9 


S. galilaeus 


1.338 


41.8 


0.690 


21.5 



1 0 The ability to increase the yield of aclacinomycins by pSgs4 in the mutant H063 suggests 
that the genes according to the present invention are useful in strain improvement. 

Example 2. Compounds generated by pSgs4 

1 5 The seed culture, 180 ml of El culture of the plasmid containing strains, Ml 8/pSgs4 or 
M90/pSgs4, was obtained by cultivating each of the strains in three 250 ml Erlenmeyer 
flasks containing 50 ml of El -medium supplemented with thiostrepton (5 |iig/ml) for four 
days at 30°C, 330 rpm. The combined culture broths (1 80 ml) were used to inoculate 13 1 
of El -medium in a fermentor (Biostat E). Fermentation was carried out for five days at 

20 28'^C (330 rpm, aeration: 450 1/min). 

The cells were harvested by centrifuging. 2.6 1 of methanol was used to brake the bacterial 
cells. The anthracycline metabolites were extracted fi-om methanol solution at pH 8 using 2 1 
of ethyl acetate and the extract was evaporated to dryness. The viscous residue was loaded 
25 onto a silica column of 4 x 10 cm and toluene:ethyl acetate: formic acid (50:50:3) with in- 
creasing amount of methanol was used as an eluent. Pure fractions were pooled and 
extracted with IM phosphate buffer (pH 8.0) and water. Organic phase was dried with 
anhydrous Na2S04 and then treated with hexane to effect precipitation. Pure compounds 
appeared as red powders dried under vacuum. 



30 
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Complete structural determination of the compounds were accomplished by NA4R. Proton 
and carbon assignments were based on a conventional NOE difference, pHSQC and HMBC 
measurements. Connectivities in particular relied heavily on HMBC experiment. 

As deduced from the data given in Table 4, the structures revealed were L-rhamnosyl-£- 
rhodomycinone (1) and L-daunosaminyl-s-rhodomycinone (2) shown in Figure 4. 

Although these structures were not novel, the generation of the hybrid products by the 
genes involved in glycosylation portion of aclacinomycin biosynthesis well demonstrates 
that the genes of pSgs4 are functional and ready to use in drug discovery for finding novel 
molecules. 

Deposited microorganisms 

The following microorganisms were deposited according to the Budapest Treaty at 
Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSMZ), Mascheroder 
Weg lb, D-38124 Braunschweig, Germany. 

Microorganism Accession number Date of deposit 



S. lividans TK24/pSgs4 DSM 12998 

E. coll XLlBlueMRFVpSgcS DSM 12999 



12 August 1999 
12 August 1999 
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Table 4. 'H and ^^C chemical shifts of 1 (DMSO^J and 2 (trace of TFA in DMSOaJ in 400 
and 100 MHz, respectively. 





1 


2 


Site 


'H 




'H 














1 


7.74, IH, dd, 7.5, 0.9 


118.9(d) 


7.74, IH, dd, 7.5, 1.0 


119.7(d) 


2 


7.64, IH, dd, 8.4, 
7.5 


136.5(d) 


7.68, IH, dd, 8.1, 7.5 


137.4(d) 


3 


7.22, IH, dd, 8.4, 0.9 


124.1(d) 


7.24, IH, dd, 8.1, 1.0 


125.0(d) 


4 




161. 8(s) 




162.6(s) 


4-OH 


12 00, IH, s 


- 


exchange broadened 


- 


4a 


- 


115. 2(s) 


- 


]15.9(s) 


5 


- 


189.9(s) 


- 


190,6(s) 


5a 


- 


110.4(s) 


- 


1 1 1 .4(s) 


6 


- 


156.2(s) 


- 


157.1(s) 


6-OH 


13.41, IH, s 


- 


exchange broadened 


- 


6a 


- 


135. l(s) 


- 


135. 7(s) 


7 


5.14, IH, d, 4.5 


70.9(d) 


5.15, IH, d, 3.6 


71.3(d) 


8A 


2.31, IH, d, 15.1 


28.9(t) 


2.33, IH, d, 14,6 


34.0(t) 


8B 


2.14, IH, dd, 15.1, 
4.5 




2.21, IH, dd, 14.6, 3.8 




9 




70.0(s) 


- 


70.9(s) 


10 


4.16, IH, s 


51.2(d) 


4.23, IH, s 


51.8(d) 


10a 


- 


134.8(s) 


- 


136.1(s) 


11 


- 


156.0(s) 


- 


156.8(s) 


1 1-OH 


12.77, IH, s 


- 


exchange broadened 


- 


11a 


- 


110.8(s) 


- 


lll.l(s) 


12 


- 


185.4(s) 


- 


186.0(s) 


12a 


- 


I32.6(s) 


- 


133.3(s) 


13A 


1.73, IH, dq, 13.9, 
7.4 


31.7(t) 


1.83, IH, dq, 14.1, 7.3 


32.0(t) 


13B 


1.38, IH, dq, 13.9, 
7.4 




\A7, IH, dq, 14.1, 7.3 




14 


1.05, 3H, t, 7.4 


6.09(q) 


1.13,3H, t, 7.3 


6.90(q) 


15 




170.4(s) 




171. l(s) 


16 


3.63, 3H, s 


51.7(q) 


3.70, 3H, s 


52.3(q) 


r 


5.28, IH, brs 


103.7(d) 


5.52, IH, d, 3.1 


100.7(d) 


2' 


3.83, IH, d, 5.2 


70.9(d) 


2.18, 2H, m 


27.1(t) 


3' 


3.44, IH, dd, 9.0, 5.2 


70.8(d) 


3.40, IH, dd, 11.8, 5.1 


55.5(d) 


4' 


3.41, IH, dd, 9.1, 9.0 


72.0(d) 


3.98, IH, brs 


67.0(d) 


5' 


3.77. IH, dq, 9.1,6.2 


68.9(d) 


4.21, IH, q, 6.3 


65.3(d) 


6' 


1.29, 3H, d, 6.2 


16.9(q) 


1.32, 3H, t, 6.3 


16.7(q) 
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Claims 

1 . An isolated and purified DNA fragment, which is the gene cluster for the anthracycline 
biosynthetic pathway of the bacterium Streptomyces galilaeus, being included in a 7 kb 

5 Xho\-Not\ fragment and a flanked 8.5 kb Bglll fragment of S. galilaeus genome. 

2. The DNA fragment according to claim 1, which comprises the nucleotide sequence given 
in SEQ ID NO: 14, or a part thereof having similar characteristics, or a sequence showing at 
least 84 % homology to said sequence. 

10 

3. A recombinant DNA, which comprises the DNA fragment of claim 1 or 2, or a part 
thereof having similar characteristics, cloned in the plasmid replicating in Streptomyces or in 
E. coll. 

15 4. The recombinant DNA according to claim 3, which is the plasmid pSgs4 deposited in S. 
lividans strain TK24/pSgs4 with the accession number DSM 12998. 

5. The recombinant DNA according to claim 3, which is the plasmid pSgc5 deposited in E, 
coli strain XLlBlueMRFVpSgcS with the accession number DSM 12999. 

20 

6. Use of the genes derived from the DNA fragment of claim 1 or 2 in the production of 
anthracycline metabolites. 

7. Use of the genes derived from the DNA fragment of claim 1 or 2 to increase aclacino- 
25 mycin production. 

8. Use according to claim 6 or 7, wherein the genes are encoding an activator, a 
dehydratase, an oxidoreductase, a dTDP-glucose 4,6-dehydratase, a glycosyl transferase, an 
isomerase, an aklaviketone reductase, a polyketide assembler, a cyclase, an aminomethylase, 

30 a glucose- 1 -phosphate thymidylyl transferase, and an aminotransferase. 
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9. A process for increasing aclacinomycin production in a bacterial host, comprising 
transferring the DNA fragment of claim 1 or 2 into a Streptomyces host, cultivating the 
recombinant strain obtained, and isolating the aclacinomycins produced. 

10. The process according to claim 9, wherein the Streptomyces host is a Streptomyces 
galilaeus host. 

1 1. The process according to claim 10, wherein the Streptomyces galilaeus host is a mutant 
strain derived from S, galilaeus ATCC 31615. 

12. A process for producing metabolites, comprising transferring the DNA fragment of 
claim 1 or 2 mto a Streptomyces host, cultivating the recombinant strain obtained, and 
isolating the compounds produced. 



13. A process for producing anthracycline metabolites, comprising transferring the DNA 
fragment according to claim 1 or 2 into a Streptomyces peucetius host, cultivating the 
recombinant strain obtained, and isolating the compounds produced. 
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SEQUENCE LISTING 

<110> Galilaeus Oy 

<120> The gene cluster involved in aclacinomycin 

biosynthesis, and its use for genetic engineering 

<130> 33318 

<140> 
<141> 

<160> 16 

<170> Patentin Ver. 2.2 

<210> 1 
<211> 662 
<212> PRT 

<213> Streptornyces ga^lilaeus 
<400> 1 

Met Thr Glu Asp Arg Val Thr Thr Leu Gly Gly Glu Gin lie Ala Leu 
15 10 15 

Leu Ala Pro Leu Leu Asp Gly Ser Arg Asp Leu Pro Gly lie Val Ala 
20 25 30 

Asp Ala Ala Pro Arg Leu Pro Ala Gly Leu Ala Glu Arg Leu Val Thr 
35 40 45 

Arg Leu Leu Asp Ala Gly Leu Leu Cys Ala Tyr Pro Gin Asp Gly Ala 
50 55 60 

Asp Arg Pro Glu Arg Ala Tyr Arg Ser Leu Thr Gly Leu Gin Ala Arg 
65 70 75 80 

Ser Ala Asp Ala Arg Asp Ala Val Leu Ala Ala Val Asp Leu Thr Gly 
85 90 95 

Asp Ala Glu Ser Pro Leu Pro Glu Ala Val Ser Ala Ala Gly Leu Arg 
100 105 110 

Ala Ala Ala Pro Gly Glu His Ala Ala Leu Thr Leu Val Leu Cys His 
115 120 125 

Asp Tyr Leu Asp Pro Arg Leu Ser Ala Leu Asp Ala Glu His Arg Ala 
130 135 140 

Thr Gly Arg Gly Trp Leu Pro Val Arg Ala Asn Gly Thr His Leu Trp 
145 150 155 160 

lie Gly Pro Phe Phe Ser Ala Gly Asp Gly Pro Cys Trp Ser Cys Leu 
165 170 175 



Ala Asp Arg Leu Arg Leu Arg Arg Arg Gly Glu Ala Tyr Val Gin His 
180 185 190 



wo 01/23578 



PCT/FIOO/00819 



Arg Leu Gly His Ser Gly Pro Ala Val His Arg Arg Ala Tyr Leu Pro 
195 200 205 

Ala Gly Arg Ala Ala Ala Leu Gin Leu Ala Leu Leu Glu Ala Gly Lys 
210 215 220 

Trp Leu Ser Gly His Arg Asp Thr Val Gin Asp Ser Leu Trp Arg Leu 
225 230 235 240 

Asp Thr Arg Thr Leu Glu Ser Ser Arg His Pro Val Arg Arg Arg Pro 
245 250 255 

Gin Cys Ser Arg Cys Gly Asp Pro Leu Leu Val Arg Asp Arg Val Ser 
260 265 270 

Ala Pro Val Val Leu Ser Ser Arg Pro Val Arg Asp Glu Ser Gly Gly 
275 280 285 

Gly His Arg Thr Phe Gly Pro Gin Glu Met Leu Asp Arg Tyr Gly His 
290 295 300 

Leu Val Asp Pro Val Thr Gly Val Val Gly Glu lie Arg Arg Asp Pro 
305 310 315 320 

Arg Gly Pro Glu Phe Leu Asn Cys Phe Thr Arg Ser Arg Cys Arg Leu 
325 330 335 

Gly Pro Arg Ala Ala Pro Pro Ala Leu His Ser Pro Leu Arg Ser Pro 
340 345 350 

Gly Ser Gly Lys Gly Val Thr Glu Leu His Ala Arg Val Ser Ala Leu 
355 360 365 

Ala Glu Ala Leu Glu Arg Cys Ser Gly Tyr Phe Gin Gly Asp Glu Pro 
370 375 380 

Arg Arg Arg Gly Ser Tyr Arg Glu Leu Ala Gly Leu Ala Val His Pro 
385 390 395 400 

Asp Ser Val Gin Leu Phe Asp Arg Arg Gin Phe Glu Asp Arg Arg Ala 
405 410 415 

Trp Asn Arg Ala His Gly Pro Phe His Gin Val Thr Glu Pro Phe Asp 
420 425 430 

Glu Asp Ala Pro lie Asp Trp Thr Pro Val Trp Ser Leu Thr Glu Arg 
435 440 445 

Arg Gin Arg Leu Ala Pro Thr Ser Leu Leu Tyr Tyr Asn Ala Pro Asp 
450 455 460 

Ala Asp Thr Gly Phe Cys Arg Ala Thr Ser Asn Gly Ala Ala Ala Gly 
465 470 475 480 

Thr Ser Leu Glu Asp Ala Val Val His Gly Cys Leu Glu Leu Val Glu 
485 490 495 
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Arg Asp Ala lie Ala Leu Trp Trp Tyr Asn Arg Thr Arg Gin Pro Gly 
500 505 510 

Val Thr Leu Asp Ala Arg Asp Pro Trp lie Thr Arg Leu Arg Ala Val 
515 520 525 

Leu Arg Asp Leu Gly Arg Thr Val Trp Ala Leu Asp Leu Thr Ser Asp 
530 535 540 

Leu Gly He Pro Val Val Ala Ala Val Ser Val Arg Thr Gly Gly Thr 
545 550 555 560 

Ala Glu Asp He Val Leu Gly Phe Gly Ala His Phe Asp Pro Arg He 
565 570 575 

Ala Leu Arg Arg Ala Leu Thr Glu Leu Ser Gin Met Leu Pro Pro Leu 
580 585 590 

Ala Gin Glu Thr Ala Gly Asp Ala Ser Ala Tyr Thr Gly Thr Asp Pro 
595 600 605 

Glu Ala Mec Arg Trp Phe Arg His Ala Thr Thr Ala Asn Gin Pro Tyr 
610 615 620 

Leu Leu Pro Ala Ala Arg Arg Ser Ala Arg Pro Pro Ala Ser Leu Arg 
625 630 635 640 

Pro Pro Arg Asp Ala Ala Ala Gin Ala Gly Ala Leu Val Ala Leu Leu 
645 650 655 

Arg Arg His Gly Leu Glu 
660 



<210> 2 
<211> 272 
<212> PRT 

<213> Streptowyces galilaeus 
<400> 2 

Val Asp He Trp Leu Leu Gly Pro Leu Thr Ala Glu Val Arg Gly Arg 
15 10 15 

Ser He Val Pro Thr Ala Ala Lys Pro Arg Gin He Leu Ala Leu Leu 
20 25 30 

Ala He His Ala Asn Arg Val Leu Pro Val Gly Thr Leu Met Glu Glu 
35 40 45 

He Trp Gly Thr Glu Pro Pro Gin Ser Ala Leu Ala Thr Leu His Thr 
50 55 60 

Tyr He Leu Gin Leu Arg Arg Arg Leu Thr Ala Ala Tyr Gly Asp Glu 
65 70 75 80 



Gly Gly Val Ser Ala Lys Asp Val Leu Val Thr Gin Tyr Gly Gly Tyr 
85 90 95 
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Cys Trp Gin Ala 
100 

Leu Val Thr Ala 
115 

Ala Ser Ala His 
130 

Leu Val Asp Val 
145 

Leu Glu Glu Ser 



Leu Arg Leu Gly 
180 

Thr Gly Arg His 
195 

Ala Leu Tyr Arg 
210 

Arg Leu Arg Arg 
225 

Arg Leu Gin Arg 



Asp Ala Pro Arg 
260 



Pro Thr Asp Ser 



Gly Arg lie Ala 
120 

Phe Arg Glu Ala 
135 

Arg lie Gly Pro 
150 

Arg Leu Gly Val 
165 

Arg His Ala Glu 



Pro Leu His Glu 
200 

Ala Gly Arg Ser 
215 

Arg Leu Ala Glu 
230 

Leu Gin Gin Ala 
245 

Tyr Gly Gly Asp 



4 

Val Asp Val Pro 
105 

Thr Ala Glu Asp 



Leu Ala Leu Trp 
140 

Val Leu Ser He 
155 

Leu Glu Arg Cys 
170 

Leu Leu Ala Glu 
185 

Gly Leu His Ala 



Trp Gin Ala Leu 
220 

Glu Leu Gly Leu 
235 

Val Leu Ser Ala 
250 

Pro Val Phe Asp 
265 



Arg Tyr Glu Arg 
110 

Arg Gin Glu Glu 
125 

Arg Gly Ser Ala 



Glu Val Ala Arg 
160 

Leu Glu Ala Asp 
175 

Leu Thr Glu Leu 
190 

Gin Cys Met Thr 
205 

Asp Val Tyr Gin 



Ser Pro Ser Pro 
240 

Glu Pro Trp Leu 
255 

Arg Met He Ser 
270 



<210> 3 
<211> 434 
<212> PRT 

<213> Streptomyces galllaeus 
<400> 3 

Met Thr Ser Asp Thr Lys Ala Leu Val Leu Glu Gin Val Arg Glu Tyr 
15 10 15 

His Arg Gin Gin Gin Pro Gly Asn Phe Gin Pro Gly Val Thr Pro He 
20 25 30 

Leu Ser Ser Gly Ala Val Leu Asp Glu Glu Asp Arg Val Ala Leu Val 
35 40 45 

Glu Ala Ala Leu Asp Leu Arg He Ala Ala Gly Ala His Ser Arg Arg 
50 55 60 

Phe Glu Ser Lys Phe Ala Arg His He Gly Val Arg Lys Ala His Leu 
65 70 75 80 



Val Asn Ser Gly Ser Ser Ala Asn Leu Leu Ala Leu Ser Ala Leu Thr 
85 90 95 
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Ser Pro Arg Leu Gly Glu Gin Arg Leu Arg Pro Gly Asp Glu Val He 
100 105 110 

Thr Val Ala Gly Gly Phe Pro Thr Thr Val Asn Pro He Leu Gin Asn 
115 120 125 

Gly Leu Thr Pro Val Phe Val Asp Leu Glu Leu Gly Thr Tyr Asn Thr 
130 135 140 

Thr Val Glu His Val Arg Ala Ala He Ser Asp Arg Thr Arg Ala He 
145 150 155 160 

Men He Ala His Thr Leu Gly Asn Pro Tyr Gin Val Ala Glu He Gin 
165 170 175 

Gin Leu Ala Thr Glu His Glu Leu Phe Leu He Glu Asp Asn Cys Asp 
180 185 190 

Ala Val Gly Ser Thr Tyr Gin Gly Arg Met Thr Gly Thr Phe Gly Asp 
195 200 205 

Leu Ala Thr Val Ser Phe Tyr Pro Ala His His He Thr Thr Gly Glu 
210 215 220 

Gly Gly Cys Val Leu Thr Arg Asn Leu Glu Leu Ala Arg He Val Glu 
225 230 235 240 

Ser Phe Arg Asp Trp Gly Arg Asp Cys Trp Cys Glu Pro Gly Glu Asp 
245 250 255 

Asn Thr Cys Leu Lys Arg Phe Asp Tyr Gin Leu Gly Asn Leu Pro Lys 
260 265 270 

Gly Tyr Asp His Lys Tyr He Phe Ser His He Gly Tyr Asn Leu Lys 
275 280 285 

Ala Thr Asp Leu Gin Gly Ala Leu Ala Leu Ser Gin Leu Asn Lys Leu 
290 295 300 

Pro Glu Phe Gly Ala Ala Arg Arg Arg Asn Trp Gin Arg Leu Arg Asp 
305 310 315 320 

Gly Leu Ala Asp Val Pro Gly Leu Leu Leu Pro Val Ala Thr Pro Gly 
325 330 335 

Ser Asp Pro Ser Trp Phe Gly Phe Val He Thr Val Leu Pro Asp Ala 
340 345 350 

Thr Tyr Thr Arg Arg Asp Leu Val Ala Phe Leu Glu Glu Arg Arg He 
355 360 365 

Gly Thr Arg Arg Leu Phe Gly Gly Asn Leu Thr Arg His Pro Ala Tyr 
370 375 380 



Leu Gly Thr Pro His Arg Val Ala Gly Asp Leu Arg Asn Ser Asp He 
385 390 395 400 
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He Thr Glu Gin Ser Phe Trp He Gly Val Tyr Pro Gly He Thr Glu 
405 410 415 

Glu Met Thr Asp Tyr Met Arg Glu Ser He Val Glu Phe Val Thr Lys 
420 425 430 

Asn Gly 



<210> 4 
<21I> 329 
<212> PRT 

<213> Strep tomyces galilaeus 
<400> 4 

Met Pro Lys Asp Thr Pro Arg Pro Val Leu Arg He Gly Val Leu Gly 
15 10 15 

Cys Ala Asp He Ala Val Arg Arg He Leu Pro Ala He Val Glu His 
20 25 30 

Pro Ser Val Arg Leu Val Ala Leu Ala Ser Arg Asp Gly Ala Arg Ala 
35 40 45 

Glu Arg Leu Ala Ala Arg Phe Gly Cys Ala Ala Val Thr Gly Tyr Lys 
50 55 60 

Ala Leu Leu Asp Arg Glu Asp He Asn Ala Val Tyr Val Pro Leu Pro 
65 70 75 80 

Pro Gly Met His His Glu Trp Val Thr Glu Ala Leu Thr Ala Gly Lys 
85 90 95 

His Val Leu Val Glu Lys Pro Leu Ser Thr Thr Tyr Ala Gin Ser Val 
100 105 110 

Asp Leu Val Ala Met Ala Gly Arg Leu Gly Leu Ala Leu Thr Glu Asn 
115 120 125 

Phe Met Phe Leu His His Ser Gin His Glu Ala Val Arg Ala Met Thr 
130 135 140 

Gly Glu He Gly Glu Leu Arg Val Phe Thr Ser Ser Phe Gly Val Pro 
145 150 155 160 

Pro Pro His Pro Ser Ser Phe Arg His Asp Ala Arg Leu Gly Gly Gly 
165 170 175 

Ala Leu Leu Asp Val Gly Val Tyr Pro Leu Arg Ala Ala Gin Leu His 
180 185 190 

Leu Ala Gly Glu Leu Asp Val Leu Gly Ala Cys Leu Arg Val Asp Glu 
195 200 205 



Ala Thr Gly Val Asp Val Ala Gly Ser Ala Leu Leu Ser Thr Ala Thr 
210 215 220 
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Gly Val Thr Ala Gin Leu Asp Phe Gly Phe Gin His Ala Tyr Arg Ser 
225 230 235 240 

Val Tyr Ala Leu Trp Gly Ser Arg Gly Arg Leu Ser Val Pro Arg Ala 
245 250 255 

Phe Thr Pro Pro Arg Glu His Arg Pro Val Val Arg lie Glu Gin Gin 
260 265 270 

Asp Arg Leu Thr Glu Val Thr Leu Pro Ala Asp His Gin Val Gly Asn 
275 280 285 

Ala Leu Asp Ala Phe Ala Ser Ala Val His Ser Glu Thr Val Arg Ala 
290 295 300 

Ser Leu Gly Glu Ala Leu Leu Arg Gin Ala Leu Leu Val Glu Gin Val 
305 310 315 320 

Arg Lys Ala Ala Arg Val Val Ser Gly 
325 



<210> 5 
<211> 323 
<212> PRT 

<213> Streptornyces galllaeus 



<400> 5 

Met Arg Val Leu 

1 

Val Arg Ser Leu 
20 

Val Thr Val Val 
35 

Pro Leu Ala Asp 
50 

Leu Asp Ala Leu 
65 

Phe Ala Ala Glu 



Phe Val Arg Thr 
100 

Leu Arg Gly Gly 
115 

Tyr Gly Ser lie 
130 

Pro Asn Ser Pro 
145 



He Thr Gly Gly 
5 

Leu Ala Gly Thr 



Asp Leu Leu Thr 
40 

Pro Arg Leu Asp 
55 

Leu Thr Val Val 
70 

Thr His Val Asp 
85 

Asn Val Leu Gly 



Val Gly Thr Phe 
120 

Ala Gin Gly Thr 
135 

Tyr Ala Ala Ser 
150 



Ala Gly Phe lie 
10 

Leu Pro Gly Pro 
25 

Tyr Ala Gly Asp 



Phe Arg Arg Leu 
60 

Pro Gly His Asp 
75 

Arg Ser Leu Ser 
90 

Thr Gin Ser Leu 
105 

Val His Val Ser 



Trp Thr Glu Glu 
140 

Lys Ala Gly Ser 
155 



Gly Ser His Tyr 
15 

Arg Pro Ser Arg 
30 

Thr Gly Asn Leu 
45 

Asp He Arg Asp 



Ala Val Val His 
80 

Glu Pro Ala Glu 
95 

Leu Glu Ala Ser 
110 

Thr Asp Glu Val 
125 

Ala Pro Leu Leu 



Asp Leu Val Ala 
160 



wo 01/23578 



PCT/FIOO/00819 



Arg Ser Tyr Trp 



Ala Asn Asn Tyr 
180 

Phe Val Thr Glu 
195 

Gly Gly Asn Val 
210 

Val His Ala Val 
225 

Gly Gly Gly Thr 



Ala Leu Cys Gly 
260 

Lys Gly His Asp 
275 

Glu Leu Gly Tyr 
290 

Val Val Asp Trp 
305 

Glu Arg Val 



Arg Thr His Gly 
165 

Gly Pro Arg Gin 



Leu Leu Ala Gly 
200 

Arg Glu Trp Leu 
215 

Leu Thr Gly Gly 
230 

His Leu Thr Asn 
245 

Thr Asp Trp Ser 



Leu Arg Tyr Ala 
280 

Arg Pro Leu Arg 
295 

Tyr Arg Asp Arg 
310 



8 

Leu Asp Val Arg 
170 

His Pro Glu Lys 
185 

Arg Pro Val Pro 



His Val Asp Asp 
220 

Arg Pro Gly Glu 
235 

Arg Glu Met Thr 
250 

Arg Val Arg Gin 
265 

Val Asp Asp Thr 



Ser Leu Asp Asp 
300 

Gin Thr His Arg 

315 



Thr Thr Arg Cys 
175 

Leu lie Pro Leu 
190 

Leu T^^r Gly Asp 
205 

His Cys Arg Ala 



lie Tyr Asn lie 
240 

Ala Lys Leu Leu 
255 

Val Pro Asp Arg 
270 

Lys lie Arg Glu 
285 

Gly Leu Arg Glu 



Pro Glu Pro Ala 
320 



<210> 6 
<211> 443 
<212> PRT 

<213> Streptomyces galilaeus 
<400> 6 

Met Arg Val Leu Leu Thr Ser Phe Ala Leu Asp Ala His Phe Asn Gly 
15 10 15 

Ser Val Pro Leu Ala Trp Ala Leu Arg Ala Ala Gly His Glu Val Arg 
20 25 30 

Val Ala Ser Gin Pro Ala Leu Thr Ala Ser lie Thr Ala Ala Gly Leu 
35 40 45 

Thr Ala Val Pro Val Gly Ala Asp Pro Arg Leu Asp Glu Met Val Lys 
50 55 60 

Gly Val Gly Asp Ala Val Leu Ser His His Ala Asp Gin Ser Leu Asp 
65 70 75 80 



Ala Asp Thr Pro Gly Gin Leu Thr Pro Ala Phe Leu Gin Gly Trp Asp 
85 90 95 
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Thr Met Met Thr Ala Thr Phe Tyr Thr Leu lie Asn Asp Asp Pro Met 
100 105 110 

Val Asp Asp Leu Val Ala Phe Ala Arg Gly Trp Glu Pro Asp Leu lie 
115 120 125 

Leu Trp Glu Pro Phe Thr Phe Ala Gly Ala Val Ala Ala Lys Val Thr 
130 135 140 

Gly Ala Ala His Ala Arg Leu Leu Ser Phe Pro Asp Leu Phe Met Ser 
145 150 155 160 

Met Arg Arg Ala Tyr Leu Ala Gin Leu Gly Ala Ala Pro Ala Gly Pro 
165 170 175 

Ala Gly Gly Asn Gly Thr Thr His Pro Asp Asp Ser Leu Gly Gin Trp 
180 185 190 

Leu Glu Trp Thr Leu Gly Arg Tyr Gly Val Pro Phe Asp Glu Glu Ala 
195 200 205 

Val Thr Gly Gin Trp Ser Val Asp Gin Val Pro Arg Ser Phe Arg Pro 
210 215 220 

Pro Ser Asp Arg Pro Val Val Gly Met Arg Tyr Val Pro Tyr Asn Gly 
225 230 235 240 

Pro Gly Pro Ala Val Val Pro Asp Trp Leu Arg Val Pro Pro Thr Arg 
245 250 255 

Pro Arg Val Cys Val Thr Leu Gly Met Thr Ala Arg Thr Ser Glu Phe 
260 265 270 

Pro Asn Ala Val Pro Val Asp Leu Val Leu Lys Ala Val Glu Gly Leu 
275 280 285 

Asp He Glu Val Val Ala Thr Leu Asp Ala Glu Glu Arg Ala Leu Leu 
290 295 300 

Thr His Val Pro Asp Asn Val Arg Leu Val Asp His Val Pro Leu His 
305 310 315 320 

Ala Leu Leu Pro Thr Cys Ala Ala He Val His His Gly Gly Ala Gly 
325 330 335 

Thr Trp Ser Thr Ala Leu Val Glu Gly Val Pro Gin He Ala Met Gly 
340 345 350 

Trp He Trp Asp Ala He Asp Arg Ala Gin Arg Gin Gin Ala Leu Gly 
355 360 365 

Ala Gly Leu His Leu Pro Ser His Glu Val Thr Val Glu Gly Leu Arg 
370 375 380 

Gly Arg Leu Val Arg Leu Leu Asp Glu Pro Ser Phe Thr Ala Ala Ala 
385 390 395 400 
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Ala Arg Leu Arg Ala Glu Ala Glu Ser Glu Pro Thr Pro Ala Gin Val 
405 410 415 

Val Pro Val Leu Glu Arg Leu Thr Ala Gin His Arg Ala Arg Glu Pro 
420 425 430 

Arg Arg Pro Gly Gly Thr Ser Pro Cys Val Ser 
435 440 



<210> 7 
<211> 443 
<212> PRT 

<213> Streptomyces galllaeus 
<400> 7 

Val Gin Thr Gin Asn Ala Pro Glu Thr Ala Glu Asn Gin Gin Thr Asp 
15 10 15 

Ser Glu Leu Gly Arg His Leu Leu Thr Ala Arg Gly Phe His Trp lie 
20 25 30 

Tyr Gly Thr Ser Gly Asp Pro Tyr Ala Leu Thr Leu Arg Ala Glu Ser 
35 40 45 

Asp Asp Pro Ala Leu Leu Thr Arg Arg lie Arg Glu Ala Gly Thr Pro 
50 55 60 

Leu Trp Gin Ser Thr Thr Gly Ala Trp Val Thr Gly Arg His Gly Val 
65 70 75 80 

Ala Ala Glu Ala Leu Ala Asp Pro Arg Leu Ala Leu Arg His Ala Asp 
85 90 95 

Leu Pro Gly Pro Gin Arg His Val Phe Ser Asp Ala Trp Ser Asn Pro 
100 105 110 

Gin Leu Cys His lie lie Pro Leu Asp Arg Ala Phe Leu His Ala Ser 
115 120 125 

Asp Ala Asp His Thr Arg Trp Ala Arg Ser Ala Ser Ala Val Leu Gly 
130 135 140 

Ser Ala Gly Gly Ala Pro Ala Glu Gly Val Arg Glu His Ala Gly Arg 
145 150 155 160 

Val His Arg Glu Ala Ala Asp Arg Thr Gly Asp Ser Phe Asp Leu Met 
165 170 175 

Ala Asp Tyr Ser Arg Pro Val Ala Thr Glu Ala Ala Ala Glu Leu Leu 
180 185 190 

Gly Val Pro Ala Ala Gin Arg Glu Arg Phe Ala Ala Thr Cys Leu Ala 
195 200 205 



Leu Gly Val Ala Leu Asp Ala Ala Leu Cys Pro Gin Pro Leu Ala Val 
210 215 220 
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Thr Arg Arg Leu Thr Glu Ala Val Glu Asp Val Arg Ala Leu Val Gly 
225 230 235 240 

Asp Leu Val Glu Ala Arg Arg Thr Gin Pro Gly Asp Asp Leu Leu Ser 
245 250 255 

Ala Val Leu His Ala Gly Ser Ser Ala Ala Ser Ala Gly Gin Asp Ala 
260 265 270 

Leu Ala Val Gly Val Leu Thr Ala Val Val Gly Val Glu Val Thr Ala 
275 280 285 

Gly Leu lie Asn Asn Thr Leu Glu Ser Leu Leu Thr Arg Pro Val Gin 
290 295 300 

Trp Ala Arg Leu Gly Glu Asn Pro Glu Leu Ala Ala Gly Ala Val Glu 
305 310 315 320 

Glu Ala Leu Arg Phe Ala Pro Pro Val Arg Leu Glu Ser Arg lie Ala 
325 330 335 

Ala Glu Asp Leu Thr Leu Gly Gly Gin Asp Leu Pro Ala Gly Ala Gin 
340 345 350 

Val Val Val His Val Gly Ala Ala Asn Arg Asp Pro Glu Ala Phe Leu 
355 360 365 

Ala Pro Asp His Phe Asp Leu Asp Arg Pro Ala Gly Gin Gly Gin Leu 
370 375 380 

Ser Leu Ser Gly Pro His Thr Ala Leu Phe Gly Ala Phe Ala Arg Leu 
385 390 395 400 

Gin Ala Glu Thr Ala Val Arg Thr Leu Arg Glu Arg Arg Pro Val Leu 
405 410 415 

Ala Pro Ala Gly Ala Val Leu Arg Arg Met Arg Ser Pro Val Leu Gly 
420 425 430 

Ala Val Leu Arg Phe Pro Leu Thr Thr Ser Ala 
435 440 



<210> 8 
<211> 267 
<212> PRT 

<213> Streptomyces ga.llla.eus 
<400> 8 

Val Asn Arg Ala Ala Arg Pro Thr Val Arg Gly Met Ser Ala lie Ala 
15 10 15 

Glu Pro Thr Ala Pro Arg Gly Val lie Val Thr Gly Gly Gly Thr Gly 
20 25 30 



lie Gly Arg Ala Thr Ala His Ala Phe Ala Asp Arg Gly Asp Arg Val 
35 40 45 
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Leu Val Val Gly Arg Thr Ala Ala Thr Leu Ala Gly Thr Ala Glu Gly 
50 55 60 

His Pro Gly He Ser Val Leu Thr Ala Asp Leu Thr Asp Pro Asp Gly 
65 70 75 80 

Pro Arg Ala He Thr Asp Ala Ala Leu Asp Ala Leu Gly Arg He Asp 
85 90 " 95 ^ 

Val Leu Val Asn Asn Ala Ala Thr Gly Gly Phe Ala Gly Leu Ala Glu 
100 105 110 

Thr Glu Pro Glu Ala Ala Arg Glu Gin Phe Asp Ser Asn Leu Leu Ala 
115 120 125 

Pro Leu Leu Leu Thr Arg Gin Thr Leu Asp Ala Leu Ser Ala Asp Gly 
130 135 140 

Gly Gly Thr Val Leu Asn He Gly Ser Ala Gly Ala Leu Gly Arg Arg 
145 150 155 160 

Ala Trp Pro Gin Asn Gly Val Tyr Gly Ala Ala Lys Ala Gly Leu Asp 
165 170 175 

Phe Leu Thr Arg Thr Trp Ala Val Glu Leu Ala Pro Arg Gly He Arg 
180 185 190 

Val Leu Gly Leu Ala Pro Gly Val He Asp Thr Gly He Gly Glu Arg 
195 200 205 

Ser Gly Met Ser Arg Glu Ala Tyr Ala Gly Phe Leu Gly Gin He Ala 
210 215 220 

Ala Arg Val Pro Ala Gly Arg Val Gly Arg Pro Glu Asp He Ala Trp 
225 230 235 240 

Trp Ala Val Gin Leu Ala Asp Pro Arg Ala Ala Tyr Ala Thr Gly Ala 
245 250 255 

Val Leu Ala Val Asp Gly Gly Leu Ser Leu Thr 
260 265 



<210> 9 
<211> 144 
<212> PRT 

<213> Streptomyces galilaeus 
<400> 9 

Met Thr Ala Gin Ala Pro Thr Ala Pro Ala Asp Val Tyr Ala Glu Val 
15 10 15 

Gin His Phe Tyr Ala Arg Gin Met Arg Tyr Leu Asp Ser Gly Glu Ala 
20 25 30 



Glu Thr Trp Ala Gly Thr Phe Thr Glu Asp Gly Ser Phe Ala Pro Pro 
35 40 45 
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Ser Leu Pro Glu Pro Val Arg Gly Arg Pro Leu Leu Ala Glu Gly Ala 

50 55 60 

Arg Asn Ala Ala Ala Gly Leu Ala Ala Ala Arg Glu Thr His Arg His 

65 70 75 80 

Trp Val Gly Met Leu Thr Val Thr Pro Ala Asp Asp Gly Ser Leu Thr 

85 90 95 

Ala Glu Ser Leu Val Ser He Val Ala Val Ala Gin Gly Gly Pro Ala 

100 105 110 

Arg Leu His Leu Val Cys Thr Cys Arg Asp Val Leu Val Arg Glu Gly 

115 120 125 

Gly Arg Leu Leu Val Arg Glu Arg Val Val Thr Arg Asp Asp Arg Pro 

130 135 140 



<210> 10 
<211> 259 
<212> PRT 

<213> Streptomyces galila^eus 
<400> 10 

Val Arg He He Asp Leu Ser Ser Pro Val Asp Ala Ala Gly Phe Glu 
15 10 15 

Pro Asp Pro Val Val His Asp Val Leu Gly Pro Lys Glu Ala Ala Thr 
20 25 30 

His Met Ser Glu Glu Met Arg Glu His Phe Gly He Asp Phe Asp Pro 
35 40 45 

Ala Glu Leu Pro Glu Gly Glu Phe Leu Ser Leu Asp Arg Leu Gin Leu 
50 55 60 

Thr Thr His Thr Gly Thr His Val Asp Ala Pro Ser His Tyr Gly Thr 
65 70 75 80 

Arg Ala Ala Tyr Arg Asp Gly Pro Pro Arg His He Asp Glu Met Pro 
85 90 95 

Leu Asp Trp Phe Phe Arg Pro Ala Val Val Leu Asp Leu Ser Asp Gin 
100 105 110 

Gly Thr Gly Ala Val Gly Ala Asp Val Leu Arg Arg Glu Met Asp Arg 
115 120 125 

He Gly His Thr Pro Ser Pro Met Asp He Val Leu Leu Arg Thr Gly 
130 135 140 

Ala Asp Ala Trp Ala Gly Thr Pro Lys Tyr Phe Thr Asp Phe Thr Gly 
145 150 155 160 



Leu Asp Gly Ser Ala Val His Leu Leu Leu Asp Leu Gly Val Arg Val 
165 170 175 
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lie Gly Thr Asp Ala Phe Ser Leu Asp Ala Pro Phe Gly Asp lie He 
180 185 190 

Thr Arg Tyr Arg Ala Thr Gly Asp Pro Ser Val Leu Trp Pro Ala His 
195 200 205 

Val He Gly Arg Asp Arg Glu Tyr Cys Gin Val Glu Arg Leu Ala Gly 
210 215 220 

Leu Asp Arg Leu Pro Ala Ala His Gly Phe Arg Val Ala Cys Phe Pro 
225 230 235 240 

Val Arg He Ala Gly Ala Gly Ala Gly Trp Thr Arg Ala Val Ala Leu 
245 250 255 



Val Asp Glu 



<210> 11 
<211> 238 
<212> PRT 

<213> Strepcomyces galllaeus 
<400> 11 

Met Tyr Gly Arg Glu Leu Ala Asp Val Tyr Glu Ala He Tyr Arg Ser 
15 10 15 

Arg Gly Lys Asp Trp Gly Gin Glu Ala Ala Asp Val Ser Arg He He 
20 25 30 

Thr Glu Arg Arg Pro Gly Ala Gly Ser Leu Leu Asp Val Ala Cys Gly 
35 40 45 

Thr Gly Ala His Leu Ser Val Phe Ser Thr Leu Phe Glu Val Ala Glu 
50 55 60 

Gly Leu Glu He Ala Glu Pro Met Arg Arg Leu Ala Glu Gin Arg Leu 
65 70 75 80 

Pro Gly Thr Thr Val His Ala Gly Asp Met Arg Asp Phe Arg Leu Pro 
85 90 95 

Arg Thr Tyr Asp Ala Val Ser Cys Met Phe Cys Ala He Gly Tyr Leu 
100 105 110 

Glu Thr Leu Asp Asp Met Arg Ala Ala Val Arg Ser Met Ala Ala His 
115 120 125 

Leu Glu Pro Gly Gly Val Leu Val Val Glu Pro Trp Trp Phe Pro Glu 
130 135 140 

Asn Phe He Glu Gly Tyr Val Ala Gly Asp Leu Ala Arg Glu Glu His 
145 150 155 160 



Arg Thr He Ala Arg He Ser His Thr Thr Arg Lys Gly Arg Ala Thr 
165 170 175 
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Arg Met Glu Val 
180 

Phe Thr Glu lie 
195 

Ala Ala Phe Thr 
210 

Pro Thr Gly Arg 
225 



Arg Phe Thr Val 



Asp Val Leu Thr 
200 

Asp Ala Gly Cys 
215 

Gly Leu Phe Val 
230 



15 

Gly Asp Ala Ala 
185 

Leu Phe Thr Arg 



Ser Val Glu Phe 
220 

Gly Val Arg Glu 
235 



Gly lie Gin Gin 
190 

Asp Glu Tyr Thr 
205 

Leu Glu Asp Gly 



Gin Arg 



<210> 12 
<211> 291 
<212> PRT 

<213> Screptomyces galllaeus 
<400> 12 

Met Lys Gly lie lie Leu Ala Gly Gly Ser Gly Thr Arg Leu His Pro 
15 10 15 

lie Thr Val Ser Val Ser Lys Gin Leu Leu Pro Val Gly Asp Lys Pro 
20 25 30 

Met lie Tyr Tyr Pro Leu Ser Val Leu Met Leu Ala Asp lie Arg Glu 
35 40 45 

lie Leu Leu lie Cys Thr Glu Arg Asp Leu Glu Gin Phe Arg Arg Leu 
50 55 60 

Leu Gly Asp Gly Ser Gin Leu Gly Leu Arg lie Asp Tyr Ala Val Gin 
65 70 75 80 

Asn Arg Pro Ala Gly Leu Ala Asp Ala Phe Val lie Gly Ala Asp His 
85 90 95 

Val Gly Asp Asp Asp Val Ala Leu Val Leu Gly Asp Asn He Phe His 
100 105 110 

Gly His His Phe Tyr Asp Leu Leu Gin Ser Asn Val His Asp Val Gin 
115 120 125 

Gly Cys Val Leu Phe Gly Tyr Pro Val Glu Asp Pro Glu Arg Tyr Gly 
130 135 140 

Val Gly Glu Thr Asp Ala Ser Gly Gin Leu Val Ser Leu Glu Glu Lys 
145 150 155 160 

Pro Leu Arg Pro Arg Ser Asp Leu Ala He Thr Gly Leu Tyr Leu Tyr 
165 170 175 

Asp Asn Glu Val Val Asp lie Ala Lys Asn Leu Arg Pro Ser Pro Arg 
180 185 190 



Gly Glu Leu Glu He Thr Asp Val Asn Arg Asn Tyr Leu Ala Arg Gly 
195 200 205 
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Arg Ala Arg Leu 
210 

Gly Thr Pro Glu 
225 

Glu Glu Arg Gin 



Arg Met Gly Phe 
260 

Met Ser Gin Ser 
275 



Val Asp Leu Gly 
215 

Ser Leu Leu Gin 
230 

Gly Val Arg lie 
245 

lie Asp Ala Asp 



Gly Tyr Gly Arg 
280 



Arg Gly Phe Ala 
220 

Ala Thr Gin Tyr 
235 

Ala Cys Val Glu 
250 

Met Cys His Arg 
265 

Tyr Val Met Ala 



Trp Leu Asp Ala 



Val Arg Thr Leu 
240 

Glu Val Ala Leu 
255 

Leu Gly Glu Gin 
270 

Val Ala Arg Glu 
285 



Phe Ser Gly 
290 



<210> 13 

<211> 341 

<212> PRT 

<213> Streptomyces 



galllaeus 



<400> 13 
Met Thr Thr Leu 
1 

Ala Asp lie Leu 
20 

Val Leu Gly Asp 
35 

His Gly Ala Ala 
50 

Lys Leu Ala Leu 
65 

Thr Val Ser Asn 



Gly Ala Thr Pro 
100 

Asp Thr Glu Gin 
115 



Val Trp Asp Tyr 
5 

Asp Ala Val Glu 



Ser Val Arg Gly 
40 

His Cys Val Gly 
55 

Gin Ala Leu Gly 
70 

Thr Ala Ala Pro 
85 

Val Phe Val Asp 



Val Glu Ala Ala 
120 



Leu Gin Glu Tyr 
10 

Thr Val Phe Ser 
25 

Phe Glu Glu Glu 



Val Asp Asn Gly 
60 

Val Gly Pro Gly 
75 

Thr Val Val Ala 
90 

Val His Pro Asp 
105 

Leu Thr Pro Arg 



Glu Asn Glu Arg 
15 

Ser Gly Arg Leu 
30 

Phe Ala Ala Tyr 
45 

Thr Asn Ala lie 



Asp Glu Val Val 
80 

lie Asp Ser Val 
95 

Ser Tyr Leu Met 
110 

Thr Arg Cys Leu 
125 



Leu Pro Val His Leu Tyr Gly Gin Cys Val Asp Leu Ala Pro Leu Glu 

130 135 140 

Arg Leu Ala Ala Glu His Asp Leu Phe Leu Val Glu Asp Cys Ala Gin 

145 150 155 160 



Ala His Gly Ala Arg Arg Ala Gly Arg Leu Ala Gly Thr Thr Gly Asp 
165 170 175 
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Ala Ala Ala Phe 
180 

Asp Gly Gly Ala 
195 

Arg Arg Leu Arg 
210 

Thr Pro Gly His 
225 

Arg Arg Lys Leu 



Val Ala Arg Arg 
260 

Pro His Thr Val 
275 

Arg His Pro Arg 
290 

lie Glu Leu Asn 
305 

Phe Ala His Leu 



Ser Phe Tyr Pro 



Val Val Thr Ser 
200 

Tyr Tyr Gly Met 
215 

Asn Ala Arg Leu 
230 

Arg Arg Leu Asp 
245 

Tyr Glu Asp Gly 



Pro Gly Asn Glu 
280 

Arg Asp Asp lie 
295 

lie Ser Tyr Pro 
310 

Gly Tyr Gly Lys 
325 



Thr Lys Val Leu 
185 

Arg Asp Asp Thr 



Qlu Glu Arg T^^r 
220 

Asp Glu Val Gin 
235 

Thr Tyr lie Glu 
250 

Leu Gly Asp Thr 
265 

His Val Tyr Tyr 



lie Lys Ala Leu 
300 

Trp Pro Val His 
315 

Gly Ser Leu Pro 
330 



Gly Ala Tyr Gly 
190 

His Arg Ala Leu 
205 

T^yr Val Val Gly 



Ala Glu lie Leu 
240 

Gly Arg Arg Ala 
255 

Gly Leu Val Leu 
270 

Val Tyr Thr Val 
285 

Lys Ala Tyr Asp 



Thr Met Ser Gly 
320 

Val Thr Glu Asp 
335 



Leu Ala Gly Gin He 
340 



<210> 14 
<211> 14806 
<212> DNA 

<213> Streptowyces galilaeus 



<400> 14 

ctcgaggccg tgccggcgca gcagggcgac 
gggaggccgc agcgaggcag ggggccgggc 
ctggtttgcc gtggtggcgt gccggaacca 
cgcggaggcg tcgccggccg tctcctgcgc 
gagagcccgg cgcagggcga tacgcggatc 
ctcggcggtg ccgcccgtcc gcaccgacac 
cgtgaggtcg agggcccaca ccgtcctgcc 
gatccacgga tcccgcgcgt ccagggtcac 
cagggcgacc gcgtcccgtt ccacgagttc 
gctcgttccg gcggcggctc cgttggacgt 
ggcgttgtag tagagcagac tcgtgggcgc 
gacgggggtc cagtcgatcg gggcgtcctc 
accgtgcgcg cggttccacg cccgccggtc 
gctgtccgga tgcacggcga gaccggccag 
gtcgccctgg aagtagccgc tgcaccgctc 
gtggagttcg gtgacgccct tgccggatcc 
cggtgcagcg cggggtccca gacggcagcg 



gsgggccccg gcctgtgccg cggcgtcgcg 6 0 
ggaccggcgc gccgcgggca ggagatacgg 120 
gcgcatggcc tcggggtccg tgccggtgta 18 0 
caacggcggg agcatctggc tgagttcggt 24 0 
gaagtgcgcg ccgaagccca gcacgatgtc 300 
ggcggcgacc acgggaatgc cgagatcgga 360 
cagatcgcgc aggacggccc ggagccgcgt 420 
gccgggctgg cgcgtgcggt tgtaccacca 480 
caggcagccg tgcacgacgg cgtcctccag 54 0 
ggcccggcag aagccagtgt ccgcgtccgg 600 
gagccgctgc cgccgctcgg tcagtgacca 660 
gtcgaagggc tcggtcacct ggtggaaggg 720 
ctcgaactgt ctgcggtcga agagctggac 780 
ctcgcggtaa ctgccgcgcc ggcgcggttc 84 0 
cagcgcctcg gccagcgcgc tgaccctggc 900 
cgggctgcgc agcggggagt ggagcgccgg 96 0 
cgaccgcgtg aagcagttga ggaactccgg 102 0 
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tccgcgcgga tcccggcgga tctcgccgac 
gtagcggtcc agcatctcct gcggaccgaa 
caccggccgg gaggacagca ccacgggtgc 
gccgcagcgg gagcactgcg ggcggcgccg 
ggtgcccagg cgccagaggc tgtcctgcac 
ggcttccagc agggccagrc gcagggccgc 
gcgcacggcc ggaccgcngt gccccagccg 
cagccggagc cggtccgcca gacagctcca 
gccgatccac aggtgggtgc cgctggcccg 
gtgccccgcg tccagggcac tcagccgcgg 
cagagcggcg tgctcgcccg gtgccgcggc 
cagcggggat tccgcgtcac cggtgaggtc 
ggcggaccgc gcctgaagac cggtgaggga 
gtcctgggga taggcgcaca gcagtcccgc 
gagtccggcc ggcagccggg gtgccgcgtc 
gtcgagcagc ggggccagca gggcgatctg 
ggtcatcagg tagacgggct ctcccggccg 
cagccggggg agcccgttgc caggggccgt 
gtggccgacg gggtgctgga aaggggagcc 
tccccgggac cgttgccgaa ctccgtgatg 
gcgtcacctt ccaggagggg cgtgatccgt 
gctgctgtga gtagtgctct ggccctggtt 
ccgtcagggc ggctgccagt gcgtctggca 
acatacgttc ctcatgtgcc tgttcgggtt 
gcaagactcc ctgagcatgg agctcttcgc 
acgtggacat atggttgctg ggaccgctga 
ccaccgcggc gaaaccccga cagatcctcg 
tgcccgtcgg gaccctgatg gaggagatct 
ccaccctgca cacgtacatc ctccagctgc 
aggggggcgt gtccgccaag gacgtcctcg 
cgcccacgga ctccgtggac gtaccgcgct 
ccaccgccga ggaccgccag gaggaggcgt 
ggcgggggtc cgcgctggtg gacgtgcgga 
ggctggagga gagcaggctc ggcgtgctgg 
gacgccacgc ggagctgctg gccgaactca 
^99gcctgca cgcccagtgc atgacggcgc 
tgg^cgtcta ccagaggctg cgccgccggc 
cgcgcctgca gcgtctgcag caggcggtgc 
ggtacggagg ggacccggtg ttcgaccgga 
gttcttggtg acgaactcga cgatggactc 
gcccggatac accccgatcc agaagctctg 
gcccgccacc cggtgcggcg tgccgaggta 
cagccgccgt gtgccgatcc gccgttcctc 
ggtggcgtcg ggcaggaccg tgatcacgaa 
cgcgaccggc agcagcagac cggggacgtc 
gcgccgccgg gccgcgccga actccggcag 
ttgcagatcg gtcgccttca ggttgtaacc 
gcccttgggg agattaccca gctggtagtc 
cggttcgcac cagcagtccc ggccccagtc 
gaggttgcgg gtcagtacgc agcccccctc 
actgaccgtt gccaggtccc cgaaggtccc 
cgcgtcgcag ttgtcctcga tcaggaacag 
cgccacctgg taggggttgc cgagggtgtg 
gatcgccgcc cgcacgtgct ccaccgttgt 
cacgggcgtc agcccgttct ggaggatcgg 
cgtgatcacc tcgtcacccg gacgcagccg 
agagagcgcc agcaggttcg ccgaggaccc 
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aacgccggtg acggggtcca cgagatggcc 1080 
cgtgcggcga ccaccgccgc tctcgtcccg 1140 
cgagacccgg tcgcgtacca gcagcggatc 1200 
cacgggatgg cgactgctcc ccagcgtgcg 1260 
cgtgtcgcgg tgtccggaga gccactctcc 1320 
ggcacgcccc gcgggcaggt aggcccgccg 1380 
gtgctgcacg tacgccccac cgcgccggcg 1440 
gcaggggccg tcgccggcgg agaagaacgg 1500 
gacgggcagc cagccgcgtc ccgtcgcgcg 1560 
gtcgaggtag tcgcgacaga gcaccagggt 1620 
gcgcagcccc gccgcggaca cggcctcggg 1680 
gaccgcggcc aggaccgcgt cccgcgcgtc 174 0 
ccggtaggcc cgttcggggc ggtcggcccc 1800 
gtccagcagc cgggtcacga gccgctcggc 1860 
ggcgacgacg cccggcagat cgcggctgcc 1920 
ctccccgccc agcgtggtca cccggtcctc 1980 
cgactcgacc cgcagatgcg gtgcgaaccc 204 0 
acggctcatc gagacgcagg gccgccgacc 2100 
cagcagatac agatcgcgct gtccgcgtgc 2160 
acgagatcgt cggcgaactc ggcggctgcc 222 0 
gacacgcgtg ctccttgtcg ctgccggccg 2280 
cgagctgtgc actgccgtca tcctcacggc 234 0 
caccgtgcct atgaggttta cacaccttgc 2400 
cagggcactg gttgattgcc gaagtggcca 2460 
ggtgctccgt cgaccagatg ggggagaaga 2520 
cggccgaggt gcggggcagg ccgatcgtgc 2580 
ccctgctcgc catccacgcc aatcgcgtcc 2640 
ggggcaccga gccgccccag agcgccctcg 2700 
gccgccggct gaccgctgcc tacggtgacg 276 0 
tcacccagta cggcggctac tgctggcagg 2820 
acgaacggct cgtcaccgcc ggacggatcg 2880 
cggcccactt ccgtgaggca ctcgcgctct 294 0 
tcggaccggt cctgagcatc gaggtggcgc 3000 
agcgctgcct ggaggcggac ctgaggctgg 3 06 0 
ccgaactcac cgggcgccat ccgctgcacg 3120 
tgtaccgggc gggccgctcc tggcaggcgc 3180 
t^ggcggagga actcggactc tccccgtcgc 324 0 
tctcggcgga gccctggctg gacgcgccca 3300 
tgatcagctg accgtcccgg cggctcagcc 3360 
ccgcatgtag tcggtcatct cctccgtgat 3420 
ctcggtgatg atgtcggagt tgcgcagatc 34 8 0 
cgccggatgg cgggtgaggt tgccgccgaa 354 0 
caggaaggcg accaggtcac ggcgggtgta 3600 
cccgaaccag ctgggatcgc tgcccggtgt 3660 
ggcgagcccg tcccgcagcc gctgccagtt 3720 
cttgttcaac tggctgagtg ccagcgcccc 3780 
gatgtgggag aagatgtact tgtggtcgta 3840 
gaaccgcttg aggcaggtgt tgtcctcgcc 3900 
gcggaacgac tcgacgatgc gggccagttc 3 96 0 
gcccgtggtg atgtggtggg caggatagaa 4 020 
cgtcatacgt ccctggtagg tcgagcccac 4080 
ctcgtgctcg gtggccagtt gctggatctc 4140 
cgcgatcatg atggcccggg tccggtccga 4200 
gttgtacgta ccgagttcca ggtccacgaa 4260 
gttcaccgtg gtggggaatc cgccggccac 4320 
ctgctcaccg agccggggag aggtcagcgc 4 3 80 
cgagttcacc aggtgcgcct tgcgtacgcc 4440 
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gatgtggcgg gcgaatttgc tctcgaagcg ccgggagtgg gctccggccg cgatccgcag 4500 
atccagggcg gcctccacca gggcgacccg gtcctcctcg tccaggacgg cgcccgagga 4 560 
gaggatgggc grcaccccgg gctggaaatt ccctggctgc tgctgccggt gatacccgcg 4620 
aacctgctcc aggaccagag ccttggtgtc cgacgtcata gccgtccctc cgtaaggatt 4680 
gcctcgccga ccatgctggc ccggatcggt cgaggagccg tggagccccg gccgaaggcc 4740 
ctgctccagt gggctncaag gggccttcga cctgccgtgg ctagcgtgtc cggcgacgca 4800 
tcgaactggt gaggtggcag atgccgaagg acactccacg gcccgtactc cgcatcgggg 4860 
ttctgggctg tigccgatatc gcggtgcgcc ggatcctgcc cgcgatcgtg gagcatccgt 4920 
cggtccggct ggtcgctctg gcgagccggg acggggcgcg cgccgaacgg ctcgcggccc 4 980 
gtttcggatg cgcggcggtg accggctaca aggcgctgct ggaccgtgag gacatcaacg 5040 
ccgtctacgt tcccctgccg cccggcatgc accacgaatg ggtcaccgaa gcgctgacgg 5100 
cgggcaagca cgtgctggtg gagaagccgc tcagcacgac gtacgcgcag agcgtcgacc 5160 
tggtggcgat ggccggccgg ctcggcctcg cgctcaccga gaacttcatg ttcctgcacc 5220 
actcgcagca cgaggcggtc cgggccatga ccggcgagat cggggaactg cgggtcttca 5280 
ccagttcctt cggcgtgccg ccgccccacc cctcgtcctt ccggcacgac gcgcggctcg 5340 
gcggcggcgc cccgctggac gtcggtgtct atccgctgcg cgcggcccag ctccacctcg 5400 
ccggggaact cgacgtgctg ggcgcctgtc tgcgcgtgga cgaggcgacc ggcgccgacg 5460 
tcgcgggaag cgcgctgctg tccacggcga cgggtgtgac cgcgcagctc gacttcggct 5520 
tccagcacgc gtaccggtcc gcgtacgcgc tgtggggcag ccgcggcagg ctgagcgtgc 5580 
cgcgggcctt caccccgccc cgtgagcacc gcccggcggt ccgtatcgaa cagcaggacc 5640 
gtctcaccga agtgacgctg cccgccgatc accaggtggg caacgcgctc gacgcgttcg 5700 
ccccggcggt gcactcggag accgtccgtg cctccctggg ggaggcgctg ctgcgtcagg 5760 
cgctcctggt cgagcaggta cgcaaagccg cgcgggtcgt cagcggctga gccccccgga 5820 
cgctttgcgg gcgcctgacg cccgcacgac gagagnnnnn nnnnnnnnnn nnnnngggct 5880 
ctcctcacac tcctcgcggt cgcgccccgc cggggcggct cagacccgct cggccggttc 594 0 
cggccggtgg gtctgccggt cgcggtacca gtcgacgacc tcgcgcagac cgtcgtccag 6 000 
ggaccgcagc ggccggtagc cgagttcttc gcggatcttc gtgtcgtcca ccgcgtaccg 6060 
caggtcgtgg cccttgcggt cgggcacctg gcggaccctg gaccagtcgg tgccgcacag 6120 
cgcgagcagc ttggccgtca tctcacggtc cgtcagatgg gtaccgccgc cgatgttgta 6180 
gatctcgccg ggccggccac cggtcagcac cgcgtggacg gcccggcagc ggtcgtccac 6240 
gtgcagccat cccctgacgt tgcctccgtc gccgtacagc ggcaccggcc ggccggccag 6300 
cagttcggtg acgaacagcg ggatgagctt ctccgggtgc tgacgggggc cgtagttgtt 6360 
ggcgcagcgg gtcgtgcgca catcgaggcc gtgggtgcgc cagtaggacc gtgcgaccag 6420 
gtcgcttccc gccttcgagg ccgcgtaggg ggagttgggg agcagcggcg cctcctccgt 6480 
ccaggtgccc tgcgcgatcg atccgtagac ctcgtccgtg gagacgtgca cgaacgtgcc 6540 
gaccccgccg cgcaggctcg cctccagcag cgactgggtg cccagtacgt tggtgcggac 6600 
gaactccgcc ggctcgctca gcgaacggtc gacatgggtc tcggccgcga agtgtacgac 6660 
cgcgtcgtgg ccggggacca ccgtcagcag cgcgtccagg tcgcggatgt cgagccggcg 6720 
gaagtccagc cggggatcgg ccaggggcag attgccggtg tctcccgcgt acgtgagcag 678 0 
gtcgacgacg gtgacccggc tcgggcgggg accgggcagg gtccccgcca gcagggaacg 6840 
gacgtagtgg gatccgatga aaccggcgcc gccggcgatc aggacacgca tggcgacgta 6900 
cctccggggc gtcggggttc acgggcgcgg tgctgggcgg tcagccgttc cagtacgggg 6960 
acgacctgcg ccggggtcgg ttcggactcg gcctcggccc gcaggcgggc ggccgccgcg 702 0 
gtgaacgagg gctcgtcgag gagccggacg agcctgccgc gcagtccctc caccgtcacc 7080 
tcgtgcgagg ggagatggag gcccgcgccg agtgcctgct gacgctgcgc gcggtcgatc 714 0 
gcgtcccaga tccagcccat ggcgatctgc ggcacaccct cgaccagcgc cgtcgaccag 7200 
gtgccggcgc cgccgtggtg gacgatggcc gcgcaggtcg gcagcagcgc gtgcagcggc 726 0 
acatggtcca ccagacgcac attgtcgggg acatgggtca gcagtgcccg ttcctcggcg 7320 
tcgagggtgg cgaccacctc gatgtccagc ccctcgaccg ccttgaggac gaggtcgacc 7380 
gggacggcgt tggggaactc cgaggtccgg gccgtcatgc ccagggtgac gcagacgcgg 744 0 
gggcgggtcg gcgggacgcg cagccagtcg ggtacgacgg caggccccgg cccgttgtag 7500 
gggacgtagc gcatgcccac gaccgggcgg tccgacggcg ggcggaagct gcgcggcacc 75 6 0 
tggtccaccg accactgtcc cgtcaccgcc tcctcgtcga aggggacgcc gtagcggccg 762 0 
agcgtccact ccagccactg gccgagtgag tcgtcggggt gcgtcgtccc gttccccccg 768 0 
gccgggcccg ccggggcggc gccgagctgg gcgaggtagg cccgccgcat ggacatgaag 774 0 
aggtccggaa aggacagcag ccgggcgtgg gccgcgccgg tcaccttcgc ggccaccgcg 78 0 0 
ccggcgaagg tgaacggctc ccacaggatc aggtccggtt cccagccgcg ggcgaacgcg 786 0 
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acgaggncgt cgaccatcgg atcgtcgttg 
gtgtcccagc cctggagaaa ggccggggtg 
tggtcggcgt ggtgggacag cacagcgtcg 
gggtccgcgc ccaccggcac ggcggtcagc 
ggccggcncg ccacccgcac crcgtgcccg 
gacccgttga agtgagcgtc cagcgcgaag 
ttctctgcgg cccggctttc ggcggctcgc 
gtgccgncac gcggaggtgg tcagggggaa 
catccgccgc aggaccgcgc cggccggcgc 
gaccgcggcc tccgcctgga gccgggcgaa 
gagcgacaac tgcccctgcc cggcgggacg 
cgcctcgggg tcccggttcg ccgcaccgac 
gtcctggccg cccagggtga ggtcctcggc 
cgcgaagcgc agtgcctcct ccacggcgcc 
cgcccactgc acgggacggg tgagcagcga 
gacctcgacg ccgaccacgg cggtcagcac 
ggcggcggag gagccggcgt gcagcaccgc 
ccgcgcctcg acgaggtcgc cgacgagggc 
gcgggtcacg gcgagcggct gggggcagag 
gcacgtcgcc gcgaagcgtt cccgctgggc 
ctccgtggcc accgggcggg agtagtccgc 
ggcggcctcc cggtgcacgc gcccggcgtg 
cgcactgccc aggacggcgg aggcggaccg 
gtggaggaag gcccggtcca gcgggatgat 
ggagaagacg tgccgctgcg ggcccggcag 
ggccagggcc tcggcggcga cgccgtgccg 
ccacagcggc gtcccggctt cccgtatccg 
ggcgcgcagg gtcagggcgt acgggtcgcc 
ggcggtcagg agatgacggc ccagttcgct 
cgcgttctgt gtctgcacgg cagtctcctt 
ccgtcgaccg cgagcaccgc tccggtcgcg 
gcccaccagg cgatgtcctc gggccggccg 
tgcccgagga agcccgcgta ggcctcccgt 
tcgatcacgc cgggcgccag gccgaggaca 
caggtccggg tgaggaagtc gagcccggcc 
caggcccggc ggcccagagc tccggccgag 
gccgacaggg cgtccagggt ctgccgggtg 
aactgctcgc gggcggcctc cggctcggtc 
gcgttgttca cgaggacgtc gatacggccc 
cgcggaccgt ccggatcggt gagatcggcg 
gcggtcccgg ccagggtcgc ggcggtgcgg 
gcgaaggcgt gggccgtggc ccgcccgatc 
cgcggcgctg tcggttctgc tatggcgctc 
ttcacggtcg actcccgctc ggccgcggcc 
ttcccggacc aggagccggc cgccctcccg 
gtgcagccgg gcggggccgc cctgcgcgac 
cagcgagccg tcgtccgcgg gggtgacggt 
gcgggcggcg gccagacccg ccgcggcgtt 
gcgtaccggc tccgggagcg agggcggtgc 
ccaggtctcg gcctcgcccg agtccaggta 
ctcggcgtag acgtccgccg gggccgtcgg 
gccgttcccg cgtcggggac gggtgcgtgc 
cagcgcgact ccaggatcgg tcgagcgggg 
cagcctccaa cgcttctcaa gccgcgctgg 
gccgggaggc gcgtgtgcgc atcatcgacc 
aacctgatcc cgtcgtgcac gacgttctcg 
aggagatgcg tgagcacttc ggcatcgact 
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atcagggtgt agaaggtggc ggtcatcatc 7920 
agctgtccgg gggtgtcggc gtcgagcgac 7980 
cccacgccct tgaccatctc gtccagccgg 8040 
ccggccgccg tgatgcccgc cgtcagcgcg 8100 
gcggcccgca acgcccaggc cagcggcacg 816 0 
gatgtcagga gaacccgcat ggtcggtcct 8220 
aactcgggcc gcggcacgcc gggccccggc 8280 
gcgcagcacc gcgcccagca cgggggaacg 834 0 
gagcacgggc cggcgctccc gcagcgtgcg 8400 
cgccccgaag agcgcggtgt gcgggccgga 8460 
gtcgaggtcg aagtggtccg gggccaggaa 8520 
atggacgacg acctgggcgc cggccgggag 8580 
ggcgatacgg ctctccagcc gcaccggcgg 864 0 
ggccgcgagt tcagggttct cgccgagccg 8700 
ctccagggtg tcgttgatga gcccggcggc 8760 
acccacggcg agcgcgtcct gcccggcgga 8820 
gctgagcagg tcgtcacccg gctgcgtccg 8880 
ccgtacgtcc ccgacggcct cggtgagccg 8940 
cgcggcgtcg agggcgacac cgagggcgag 9000 
ggcggggaca ccgagcagct ccgcggcggc 9060 
catgaggtcg aagctgtccc cggtacggtc 9120 
ctcgcggaca ccctccgccg gggcgccgcc 9180 
cgcccagcgc gtgtggtcgg cgtccgacgc 924 0 
gtggcacagc tgcggattgc tccaggcgtc 9300 
gtcggcgtgg cgcagggcga gccggggatc 936 0 
gccggtgacc caggcgccgg tggtgctctg 9420 
gcgggtcagc agcgcggggt cgtcgctctc 9480 
gctggtgccg tagatccagt ggaagccgcg 9540 
gtcggtctgc tggttctccg ccgtctccgg 9600 
cgggacgtgg ttcaggtgag ggacagaccg 9660 
taagcggcgc gcgggtcggc cagctggacg 9720 
acccggccgg ccgggacgcg ggcggcgatc 978 0 
gacatgcccg agcgttcacc gatgccggtg 984 0 
cggatgccgc ggggtgccag ttcgacggcc 9900 
ttggcggcgc cgtagacacc gttctgcggc 9960 
ccgatgttca gcacggttcc cccgccgtcg 10020 
agcagcagcg gggccaggag gttgctgtcg 10080 
tccgccaggc cggcgaaccc accggtggcc 10140 
aacgcgtcca gcgccgcgtc ggtgatggcg 10200 
gtcaggacac tgatgccggg gtggccctcg 1026 0 
cctacgacga gtacccggtc gccgcggtcg 10320 
ccggtcccgc cgccggtgac gatcacgccc 10380 
atgccccgga ccgtaggccg ggccgctcga 10440 
tcagggccgg tcgtcacggg tcaccacgcg 10500 
gaccaggacg tcccggcagg tgcagaccag 10560 
ggcgacgatc gagaccaggc tctcggcggt 10620 
gagcatgccg acccagtggc ggtgcgtctc 10680 
gcgggcaccc tcggcgagca gcgggcgtcc 10740 
gaaggagccg tcctcggtga aggtgccggc 10800 
ccgcatctgc cgtgcgtaga aatgctgtac 10860 
tgcctgtgct gtcatgccgt gtcacctttc 10920 
ocgggtcccc ggtgagtgtc acgacggtcg 10980 
ggcgggcgga tgcgctggcg gctgttcagt 11040 
tgagcatggg ctggctcgca ggaacccatc 11100 
tgtcctcacc cgtggacgcg gcgggttttg 11160 
gtccgaagga ggccgccacg cacatgagcg 11220 
tcgatccggc ggaactgccg gagggcgaat 11280 
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tcctctcgct cgaccgtctc cagctgacca cccacaccgg aacccatgtc gacgcgccct 11340 
cgcactacgg cacgcgcgcc gcgtaccggg acggtccgcc gcggcacatc gacgagatgc 114 00 
cgctcgacrg gttcttccgg cccgccgtcg tcctcgacct cagcgaccag ggcacgggcg 11460 
cggtcggcgc cgacgcgcrg cggcgggaga tggaccggac cggccacact ccctcgccca 11520 
tggacaccgt cctgctcagg accggtgccg acgcgtgggc gggaaccccg aagtacttca 11580 
cggacttcac cgggctggac ggttcggccg tgcacctgct gctggacctg ggggtgcggg 11640 
tgaccggcac ggacgcgtcc agcctggacg cgccgttcgg cgacaccatc acccgctacc 11700 
gggccacggg cgacccgtcg gtcctgtggc ccgcccatgt catcgggcgg gaccgggagt 11760 
actgccaggt cgagcggctc gccgggctcg accggctgcc cgccgcgcac gggttccggg 11820 
tcgcgtgctt cccggtgcgg atcgccggag cgggcgccgg ctggacgagg gcggtggccc 11880 
tggtcgacga gtgaggagcg cacggcgggc cgagcggacg acgcgcccac cggggcgccg 11940 
acggagggaa cagacatgta cggccgggaa ctcgcggacg tgtacgaggc catctaccgc 12000 
agccgcggca aggactgggg acaggaggcg gcggacgtct cgcggatcat caccgaacgg 1206 0 
cgcccgggag ccggctcgct gctcgacgtc gcctgtggca cgggcgccca tctgagcgtg 1212 0 
ttcagcacgc tgctcgaggt cgccgagggc ctggagatcg cggagccgat gcggcggctc 12180 
gccgagcagc ggctgcccgg caccaccgtg cacgccggcg acatgcgcga cttccggctc 1224 0 
ccgcgcacct acgacgcggt gagctgcatg ttccgcgcca tcggctatct ggagacgctg 12300 
gacgacatgc gggccgccgt ccggtcgatg gccgctcatc cggagcccgg cggcgtgctg 12360 
gtcgtcgaac cctggtggtt ccccgagaac ttcatcgagg gctatgtcgc gggtgacctg 12420 
gcccgcgagg agcaccggac catcgcccgg atctcgcaca ccacccggaa gggccgggcc 1248 0 
acccgcatgg aggttcgctt caccgtgggg gacgccgccg gcatccagca gttcacggag 1254 0 
atcgacgtgc tgaccctgtc caccagggac gaatacaccg ccgcgttcac cgacgccggc 12600 
tgttccgttg aattcctgga ggacggaccc accggccgcg gtcttttcgt cggtgttcgc 12660 
gaacagcgct gagccggagg aaggtcagcc cgagaattct cgggcgaccg ccatcacata 1272 0 
gcgaccgtaa ccggactgcg acatctgctc accgagtcgg cggcacatgt ccgcgtcgat 1278 0 
gaatcccatg cgcagcgcca cttcttcgac gcaggcgatg cgcacgccct gccgttcttc 12840 
gagcgtgcgc acgtactggg tggcctgaag cagcgattcc ggggttccgg cgtcgagcca 12900 
ggcgaaaccg cggccgagat cgaccagacg cgctcggccg cgggccaggt aattccggtt 12960 
cacatcggtg atctccagtt cgccgcgcgg cgagggccgg aggttcttcg cgatgtccac 13020 
cacttcgttg tcgtacagat agagcccggt gatggcgagg tcggaacgcg gccggagcgg 13080 
cttctcctcc agggagacga gttgccccga cgcgtcggtc tcgcccacgc cgtagcgctc 13140 
cgggtcctcc acggggtagc cgaacagcac gcagccctgg acgtcgtgga cgttgctctg 13200 
gagcaggtcg tagaagtggt gtccgtggaa gatgttgtcc ccgaggacca gtgccacgtc 13260 
gtcgtcaccc acgtggtcgg cgccgatgac gaaggcgtcg gccagtccgg ccggccggtt 13320 
ctgcaccgcg tagtcgatgc gcaggccgag ctgggagccg tcgcccagga gccggcggaa 13380 
ctgttccaga tcgcgctcgg tgcagatgag caggatctcc ctgatgtccg ccagcatcag 1344 0 
caccgagagc ggatagtaga tcatcggctt gtcgccgacc ggaagaagct gttttgatac 13500 
ggagacggtt atcggatgga gccgtgttcc cgaaccgccg gcgagaataa tccccttcat 1356 0 
gggtatgccc ctgtcctcga tttcttctca tgctaacgac cggattcgct cggcagcaag 1362 0 
ccggaaaatt tctggtccag ggaaaatcga gcggtaatag agggaatttc ggggttgtgg 1368 0 
accggagcga gaatgtgatg tgctgcgagg ggatggtgac gccctgtggc acggcatcgg 13740 
cgtggcatcg gcaaggcatc ggcaagggga ggaaaagaat taatgacgac tctcgtctgg 13800 
gactacctac aggaatacga gaacgaacgc gccgacattc tggacgccgt ggagacggtc 13860 
ttcagctcgg gtcggctcgt cctcggtgac agcgttcgcg gattcgagga ggagttcgcc 13920 
gcgtaccacg gcgcggcgca ctgcgtcggt gtcgacaacg gcaccaacgc gatcaagctg 13980 
gccctccagg cgctcggcgt cggccccggt gacgaggtcg tcacggtgtc caacaccgcg 14 04 0 
gcccccacgg tcgtggccat cgattcggtg ggcgccaccc cggtgttcgt cgacgtccac 14100 
ccggacagct acctcatgga caccgagcag gtggaggccg cactcacgcc ccggacccga 1416 0 
tgcctgctgc cggtccatct ctacgggcag tgcgtcgacc tggctccgct ggagcggctc 14220 
gccgcggagc acgacctgtt cctcgtcgag gactgcgcac aggcccacgg tgcccgccgc 1428 0 
gccggccggc tcgccggcac caccggcgac gccgccgcct tctccttcta ccccacgaag 14340 
gtgctcggcg cctacggcga cggaggcgcc gtggtgacct cccgggacga cacccaccgc 144 00 
gcgctgcgcc gactgcgcta ctacggcatg gaggagcggt actacgtcgt cggcaccccg 14460 
ggtcacaacg cccggctcga cgaggtccag gccgagatcc tgcggcgcaa gctgcgccgg 14520 
ctcgacacct acatcgaggg gcggcgggcc gtcgcccggc gctacgagga cgggctcggc 14580 
gacaccggcc tggtgctccc gcacaccgtc cccggcaacg agcacgtgta ctacgtctac 14 64 0 
acggtgcgcc acccgcggcg cgacgacatc atcaaggccc tcaaggcgta cgacatcgag 14700 
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ctgaacacca gctatccctg gcccgtgcac accatgnccg ggttcgccca cctcggctac 14760 
ggcaagggct cgctgcccgt caccgaggac ctggccggcc agatct 14806 



<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: degenerated 
oligonucleotide primer 

<400> 15 

csggsgssgc sggsttcats gg 22 



<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: degenerated 
oligonucleotide primer 



<400> 16 

gggwrctggy rsggsccgta gttg 
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